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Convection and Conduction Cooling of 
Substrates Containing Multiple 
Heat Sources 


By V. L. HEIN 
(Manuscript received April 12, 1967) 


An analysts is made of the steady-state temperature distribution in a 
substrate with heat inputs from multiple sources. The problem is of interest 
an connection with integrated and thin film circuits mounted on ceramic or 
glass substrates. In these applications, convective heat transfer 1s present 
with either conduction along the leads joining the substrate to the heat 
sink or conduction to one end of the substrate which 1s heat sinked. A 
formal three-dimensional solution is obtained which is evaluated for 
various geometries, thermal conductivities, coefficients of convection and 
heat-sinking conditions. 


I. INTRODUCTION 


The remarkable technologies of beam-leaded and thin film inte- 
grated circuits have resulted in a new approach to the physical design 
of electronic circuits.1 This approach as shown in Fig. 1 consists of 
bonding beam-leaded integrated circuits to ceramic or glass substrates 
containing thin film components and conductors. At present, dissipa- 
tion of the thermal energy generated in these circuits is one of the 
most severe limitations and affects both device performance and 
reliability. 

The results of this analysis will provide a better understanding of 
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Fig. 1— Integrated circuit substrate. 


the heat transfer phenomena by showing the effects of substrate area, 
shape, thickness, and thermal conductivity. Included are the effects of 
the source area, the coefficient of convection and two heat-sinking 
conditions. The two heat-sinking conditions are: one edge of the 
substrate is an isothermal boundary, and the leads from the substrate 
are connected to a heat sink. 

Convection is considered on only the two large faces of the sub- 
strate since the area of the sides is much smaller. The coefficients of 
convection are distinguishable for both large faces since some mount- 
ing positions will require that these values be different.” For instance, 
the coefficients of convection are quite different for the top and bottom 
faces of a horizontal plate. 

Although the dimensions of most substrates suggest a two-dimen- 
sional thin plate model, the very small areas of some heat sources 
indicate that large thermal gradients in the direction of the normal 
to the large faces will be present under these sources. This condition 
is also magnified by the poor thermal conductivity of some substrate 
materials. Thus, the solution obtained must be three-dimensional to 
include these effects. 

By superposition, multiple sources are considered and the interac- 
tions between sources are determined. This provides the necessary in- 
formation to design the substrate with the desired isolation between 
temperature sensitive components. 
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II. MATHEMATICAL MODEL AND BOUNDARY CONDITIONS 


The geometry of the problem is shown in Fig. 1 for the case where 
a substrate is connected to a heat sink through leads. The second 
heat-sinking condition to be considered is easily visualized if the 
leads are removed and one edge is considered an isothermal boundary. 
The temperature in a homogeneous solid of thermal conductivity k 
satisfies Poisson’s ag 
ote * oT = Oe Y, 2) . (1) 
where Q is the source ane per unit volume. Since there are 
multiple sources in many of the cases to be considered, it is con- 
venient to use the Green’s function approach. It is easily shown® that 
the formal solution can be expressed as 


PE yee) = fF is [ Gtu,v, w | 2, y,2Q(u, v, w) du dv dw 

















+f fer, “= TG) du do 
te f [orl - 26)” ava 
+h iM GT, = 7G.) dv dw, (2) 








where a, b, and ¢ are the substrate dimensions in the 2, y, and 2 
directions, respectively, and T', denotes dT'/du. The Green’s function 
G (a, y, 2 | u, v, - which is symmetric with G(u, v, w | 2, y, 2), satisfies 








: =O = ea oa 
42 - e438 v6 5(x | u) oy v) 6(z w) (3) 
The boundary se ates for G(x, y, z | u, v, w,) and T(x, y, z) are 
T= G4; =-0 x= a, 
T, =G, =0 y =0 and 8B, 


kT, = hit and kG, = h,G z2=0, 

—kT, =h.T and —kG, = h.G Z2=C, 

and either 
T=G=0 o T7,=G,=0, 2=0. (4) 


The boundary conditions reference the ambient temperature to 
zero, and h, and hp are the coefficients of convection on the two large 


1662 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1967 


faces of the substrate. The choice of the last boundary condition 
depends on whether the substrate under consideration has an iso- 
thermal boundary on one end or is supported by leads. If the sub- 
strate has leads, the latter boundary condition is used and the heat 
lost by conduction along the leads is considered by assuming negative 
sources? located at the lead bonding areas. The magnitude of these 
sources is determined by noting that the temperature difference be- 
tween the ends of a lead is given by the product of the lead thermal 
resistance and the amount of heat conducted along that lead. 

The case where the leads conduct an appreciable amount of heat 
is somewhat more difficult and cumbersome than the case of an 
isothermal boundary at x equals zero or the case where the lead’s 
thermal resistance is of sufficient magnitude to be approximated by 
an insulated boundary. In what immediately follows, the three cases 
will be treated separately and generally only one source is considered. 
The general case of multiple sources follows by superposition and 
will be treated last. 


2.1 Substrate With High-Resistance Leads 

The solution to this problem is obtained by taking finite cosine trans- 
forms’ of (3) in the x and y directions. The eigenvalues are chosen 
to satisfy the boundary conditions given by (4). Taking the double 
cosine transform gives 


é., Said BG _ —d(z — w) a au cos By (5) 


where a = mr/a (m = 0, 1, 2, ---), 8 = nr/b (n = 0, 1, 2, ---) and 
G denotes the transformed dependent variable G. A third transform 
of (5) is not taken to avoid a triple summation in the final solution. 
Although the triple summation is no great obstacle, it increases the 
number of terms needed for numerical evaluation. Thus, (5) is solved 
for the cases of 


a +6 =0 and a +6 #0. 
If a and 6 are both zero, by standard methods’ the solution to (5), 
satisfying the boundary conditions given by (4) is 


GO, 0,z|u,v, w) = k(ke/hye + k/hse + 1) , 6) 


, _ (k/he — z/e + (w+ k/h) 
CO ie eee tape ee ei 


IA 
s 


IV 
s 
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For a or 8 not equal to zero, the solution of (5) leads to 


Gla, B,z|u,v, w) = cosas f 5(¢ — w) 
Jo 


‘[o()¥@) — 6@¥(H] ds + AG@ + BY, 
where 
$(z) = sinh yz, v(2) = cosh yz 
and 
y= (@ + 6’). (7) 
Again by standard methods, (7) leads to 
Gla, B, 2 | u,v, w) = cos au cos Bolo) + (ky/h)¥@)] 


VOY) — (h./ky)o(w)) — ¢@[¢w)_— (o/ky) vw) 
ky|(he/ky + ky/hi)o( + A + ho/h,) ¥©)) (8) 


Ga, B,z| u,v, w) = cosau cos Bul(w) + (ky/h)W(w)] 


YOW@ — (e/ky)o@] — ¢Ol6@ — (e/ky)¥@] 
key [(he/ley + ky/hi)o©) +L + he/y) YO)’ 


The inversion for the double cosine transforms is easily derived* and 
upon substitution into (2) gives 


¢€ b a 1 s 
L(¢-4,2)- = i [ e G(u, v, w | 0, 0, 2) 


+= > Gu, », w|a,0,z) cosax 





IIA 
& 


IV 


WwW. 


Ae a > Gu, v, w | 0, B, 2) cos By 
ab B 


+ = 3 » Glu, v, w | a, B, 2) cos ax cos sv low. v, w) dududw. (9) 
a 8B 


It should be noted that the surface integrals of (2) are identically 
equal to zero because of the boundary conditions and the subsequent 
choice of eigenvalues. Consequently, these terms do not appear in (9) 
and the determination of the temperature distribution merely requires 
the substitution of the appropriate Green’s functions from (6) and 
(8) and the integrations indicated in (9). For the applications cited, 
the sources are on the z equals zero surface and therefore Q(z, y, 2) = 
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Q(x, y) 8 (z). Substituting this into (9) gives 
T(2e,y,2) = ffls Glu, v, 0 | 0, 0, 2) 


Gu, v, 0 | a, 0, 2) cosar 


si 


ne 


x 
> Guu, v, 0 | 0, 8, 2) cos By 
B 


—» da Gu, v,0 | a, B, 2) cos ax cos ev Jaw, v) du dv. (10) 


Note that in this case Q(u, v) represents real sources since the leads 
are assumed to be of very high resistance and thereby produce 
negligible effects. 


=| 
4 


2.2 Substrate With Heat Conducting Leads 

The method of solution for this case makes use of the results up to 
(10). It has been stated that the leads will be treated as negative 
heat sources and thus by superposition (10) will read 


b a 1 = 
L(G Y32) = [ [ E Gu, v, 0 | 0, 0,2) 
= * > Glu, v, 0 | a, 0, 2) cos ax 
2 = 
ab p» G(u, v, 0 | 0, 8, 2) cos By 


+= ys > Guu, v, 0 | a, B, 2) cosax cos By | 
a 8 


-[Q(u, v) — Flu, v)] du dv, (11) 
where F'(u, v) represents the heat sources due to leads. It is important 
to note that both Q and F can represent any arbitrary number of 
real sources and leads, respectively. This is important since it is quite 
common for a substrate to have as many as sixteen or eighteen leads. 
It is obvious at this point that F(u, v) must be specified. The ap- 
proach used here is to assume a uniform heat source over each lead 
bond area and determine the magnitudes of these sources by using 
other available information. The required information is available 
from (11) since the temperature can be evaluated at each lead loca- 
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tion in terms of each source power density. Thus, 


Fu, v) i fig, v) 5 fogou, v) a te f.gn(u, v) (12) 
represents the sources due to each lead location and f;, fe -°° fa 
are unspecified at this time. The functions g;(u, v) are a combination 
of step functions that give unity at the lead bond area and zero else- 
where. To determine fj, fe, --* fx, the condition that the temperature 
difference between the two ends of a lead is equal to the product of 
the thermal power and thermal resistance of that lead is used. Equat- 
ing these two expressions for the temperature at each lead location 
gives m equations and m unknowns in f;. In matrix notation this can 
be expressed as 


[A + R[F] = [B], (13) 
where A is an n by n matrix representing the influence coefficients due to 
the action of the negative sources. FR is also an n by n matrix but the 
only non-zero elements are those where 1 = j. These elements are the 
thermal resistances of the leads. The column matrix Ff consists of 
fi, fe, °°: f, and represents the unknowns to be determined. The 
matrix B is a column matrix which represents the effects of the real 
heat sources. By substitution of A + R = C, then (13) becomes 


CF = B, 
and, therefore, (14) 
CCE SCE: 
where C is the inverse of C. Since C*C = I, where I is the unit 
matrix, then 


if = 0"'B (15) 
gives the desired results. Using these values of f;, fe, -°* fn in (11) 


permits the calculation of the temperature at any point in the 
substrate. In the results reported here, the temperature was always 
evaluated at the center of the lead bond area and the flux was 
assumed constant over that area. This is a reasonable approximation 
since the lead material generally has a much higher thermal con- 
ductivity than the substrate. However, the problem can be solved 
for other functional representations of the flux if there is evidence that 
these representations are significantly better approximations of the 
physical situations. Similarly, the reported results for substrates with 
leads will be for a single heat source although Q (u, v) and the matrix 
B are not restricted to such situations and can represent any arbitrary 
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number of sources. Thus, the method of obtaining a formal solution for 
the temperature distribution in a substrate containing multiple sources 
and leads has been indicated. 


2.3 Substrate With Isotherm on One Boundary 

The solution of the temperature problem for the x = 0 boundary 
being an isotherm is obtained by taking a finite sine transform‘ in the 
x direction and retaining the cosine transform for the y direction. 
Applying these transforms to (8) gives 


— 6(z — w) sin au cos Bv 
k : 


where a = (2m — 1)r/2a (m = 1, 2,3, ---), 8 = nr/b (n = 0,1, 2, +--+). 
Comparing (16) and (5) indicates the solution to (5) can be used as 
the solution to (16) provided sin au is substituted to replace cos au 
and a is now given by (16). The solution to (5) was given by (6) for 
a’ + 6 = 0 and (8) fora’ + B’ ¥ 0. Since a’ + 8’ is never equal to 
zero in (16), (6) is not needed and (8) gives the solution provided the 
indicated changes are made. The inversion formula for these multiple 
finite transforms gives the temperature distribution as 


b a - 
Te, 4.2) = | [ | & Glu, », 0 | a, 0,2) sin as 


G.,-@4+ 6)G = (16) 


“+ = - Glu, v, 0 | a, B, 2) sin ax cos av Jove v) du dv, (17) 
« 6B 


where again G( u, v, w | 2, y, 2) = G(x, y, 2 | u, v, w). It should be 
noted that in (9), (10), (11), and (16) the summations on a and £ are 
only for the non-zero eigenvalues since the « = 0 or 8 = 0 terms, if 
they appear, are already indicated in the inversion formulas. 

It should be apparent that similar physical situations such as two 
opposite boundaries being isotherms poses no new or additional prob- 
lems. For instance if the boundaries « = 0 and x = a are isotherms, 
(17) is valid provided a = mx/a(m = 1, 2,3,-°°). 


III. NUMERICAL RESULTS 


In this section results are given for the three cases discussed in 
Sections 2.1, 2.2, and 2.3. For the results reported, it is assumed that 
the heat sources have uniform power density. Thus, they are mathe- 
matically represented by a combination of step functions. The results 
are reported in terms of thermal resistance where thermal resistance 
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(R.) is defined as the difference in maximum and minimum tempera- 
tures based on one watt of power dissipation. Dimensionless variables 
are used to present the results in their most general form except for 
several instances where specific results are desired. The dimensionless 
variables are Az/a, %/a, c/a, Ay/b, yo/b, c/b, Az/c, 2o/c, hic/k, and 
hoc/k, where Ax, Ay, and Az are the source dimensions and 2o, yo, and Zo 
the coordinates of the source center. For the applications cited, the 
sources are always plane sources located on one face of the substrate 
and thus Az/c = o/c = 0 in (10), (11), and (17) and consequently, in 
all the results. 

Previously, it was suggested that a three-dimensional solution would 
be necessary to accurately represent the thermal behavior of a low 
thermal conductivity substrate containing very small heat sources. 
This is because the temperature gradient in a direction normal to a 
plane source must increase at the same rate as the source area decreases 
if the same amount of heat is dissipated. Thus, small sources require 
large gradients near the source even though the body temperature 
may be nearly uniform elsewhere as would be expected if the Biot 
number is small, i.e., he/k « 1. The following physical parameters 
are chosen as a typical example of a substrate with a small centrally 
located heat source: 


a/c = 26.0, b/c = 10.0, 
%o/a = 0.50, Yo/b = 0.50, 
Ax/a = 0.0123,  Ay/b = 0.0320 
hic/k = 0.93 X 10°,  hec/k = 0.93 X 107°. 


Fig. 2 presents the results where the dependent variable is the dimen- 
sionless quantity ckT (x, y, 2). These results clearly indicate the prob- 
lem is three dimensional near the source due to spreading resistance 
and thus a three-dimension solution is required to accurately describe 
substrates containing small sources. 

The next numerical results are for square and rectangular substrates 
containing one centrally located heat source and leads that conduct a 
negligible amount of heat. Thus, the leads can be ignored in the anal- 
ysis and the equations of Section 2.1 are applicable. Obviously, this 
model always gives an upper bound for the substrate thermal resist- 
ance unless the leads are heat sinked at a higher temperature than the 
atmosphere surrounding the substrate. Except for this unlikely situ- 
ation, these results give an easy-to-obtain first approximation of the 
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REAR FACE 


Fig. 2— Integrated circuit substrate temperature, ckT(x, y, z) as a function of 
position: a/c = 26.0, b/c = 10.0, 29/a = yo/b = 0.50, Ax/a = 0.0123, Ay/b = 0.0320, 
hic/k = hee/k = 0.93 X 10-3, P = 1 watt. 
substrate thermal capability and are given in Fig. 3 (a) and (b). 
Each solid line represents a given substrate whereas each broken line 
represents a given source. Although the broken lines represent redun- 
dant information, they assist in illustrating the effects of various pa- 
rameters. Thus, Fig. 3(a) and (b) clearly show the effects of heat 
source and substrate areas. 

Results for the case where one end of the substrate is an isothermal 
boundary are given in Fig. 3(c) and (d). The equation for these re- 
sults was developed in Section 2.3. By following one of the broken lines, 
a constant heat source area is being maintained and the effects of 
changes in the substrate area can be observed. It will be noted that 
increasing the substrate area provides little benefit since, with a cen- 
rally located source, increases in the thermal resistance due to longer 
conduction paths almost cancel the decreases due to larger convection 
areas. As with the previous case, it is noted that the square substrate 
is slightly more efficient than the rectangular one. The Biot number 
used for the results presented in Fig. 3(a) thru (d) is a typical value 
for a thin alumina ceramic substrate with free convection from both 
faces. . 

Results for a substrate with heat conducting leads are more difficult 
to generalize since the thermal resistance of the leads, the lead bond 
areas and locations, and the number of leads are parameters which 
affect the results. To illustrate the effects of leads, two alumina ceramic 
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substrates containing beam-leaded monolithic integrated circuits are 
considered. These substrates will be identified as Substrates I and 
II, and are very similar to the one in Fig. 1 except that both have 
one heat source. Substrate I is supported by sixteen copper leads, 
but due to symmetry only one quadrant of the substrate need be con- 
sidered. Thus, the problem is simplified to a substrate containing one 
source and four leads. Using the procedure outlined in Section 2.2 gives 


608.7 500.3 476.6 466.0 
500.3 585.1 489.7 475.2 
476.6 489.7 583.7 497.1 
466.0 475.2 497.1 602.0 





fe) 
10° 10! 102 103 104 10° 10! 102 103 104 
ab/AxAy ab/AxdAy 


Fig. 3—Substrate thermal resistance: (a) and (c) a/e = b/c, At/a = Ay/b, 
Xo/a = yo/b = 0.50, Aic/k = hec/k = 0.93 XK 10-3; (b) ae 3 a/c = 2b/c, Ax/a = 
Ay/2b, xo/a = yo/2b = 0.50, hic/k = hec/k = 0.93 X 1 
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and 


[124.2 
pa | 1213 | (20) 
118.1 


L116.1 


where the total output from the heat source has been taken as one- 
fourth watt since only one-fourth of the real heat source lies in the 
quadrant being considered. Lead spacing was 0.190 cm (0.075 inch), 
and each lead bond area was 0.038 cm (0.015 inch) by 0.038 cm. To 
continue the analysis and determine the f;’s, it is necessary to specify 
the thermal resistances of the leads as these values are the diagonal 
elements of the matrix R. To obtain a better understanding of the ef- 
fects of the leads, it is useful to present the substrate thermal resist- 
ance as a function of the lead thermal resistance, These results are 
given in Fig. 4 and clearly indicate that the substrate thermal resist- 
ance can be substantially reduced by heat sinking the leads. Fig. 4 
also gives results for 0.107 by 0.107, 0.157 by 0.157, and 0.208 by 0.208 
cm square (0.042 by 0.042, 0.062 by 0.062, and 0.082 by 0.082 inches 
square, respectively) sources to illustrate the effects of source size for 
substrates with leads. For these physical situations, the change in 
thermal resistance due to changes in source size is within 1°C/watt of 





Rt=149 °C/WATT AS Ry co 


0.107 CM 


0.157CM 


Ry IN °C/WATT 


0.208 CM 





Oo 300 600 900 1200 
LEAD RESISTANCE , Ry IN °C/WATT 


Fig. 4— Thermal resistance of Substrate I: a = 1.61 cm, b = 0.89 em, ¢ = 0.0635 
em, Ar = Ay, k = 0.202 watt/em-°C, hi = hz = 0.003 watt/cem?-°C. 
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Fig. 5— Thermal resistance of Substrate II: a = 3.30 cm, b = 0.89 em, ¢ = 0.0635 
cm, Ar = Ay, k = 0.202 watt/em-°C, hi = he = 0.003 watt/cm?-°C. 


the results obtained for substrates without leads. This indicates that 
for small sources the source area effects are local and the effects of 
moderate changes in source area can be approximated by the results 
for substrates without heat conducting leads. 

The results for Substrate II are given in Fig. 5. This design is very 
similar to the previous one except the substrate is approximately twice 
as long, contains eighteen leads on 0.381 cm (0.150 inch) spacing and 
each lead bond area is 0.038 cm (0.015 inch) by 0.076 cm (0.030 inch). 
These results indicate a thermal resistance of 80°C/watt for this sub- 
strate as compared to 149°C/watt for the previous one if the effects of 
the leads are not included. These substrate resistances are reduced to 
45 and 62°C/watt, respectively, if the thermal resistance of each lead 
is 600°C/watt. To illustrate the effects of the leads, Fig. 6 shows the 
percent reduction in the substrate thermal resistance due to each lead. 
In this figure the leads are numbered beginning with the lead closest 
to the heat source. Although results for these two designs do not permit 
a generalization of lead effects, they do illustrate what effects may be 
expected for geometries that are reasonably similar. 

This significant effect of the leads illustrates one of the disadvantages 
of glass and glazed ceramic substrates. At present most leads bonded 
to these substrates have had much higher thermal resistances than 
those bonded to unglazed ceramics. This is because of the inability to 
repeatedly bond thick copper leads to glass or glazed substrates with- 
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Fig. 6 — Percent decrease in substrate thermal resistance due to leads. 


out fractures at the time of bonding or upon subsequent thermal 
cycling. Thus, the effect of replacing thick copper leads with thin gold 
ribbon, as is commonly used, can easily result in an additional thermal 
resistance of 20 to 40°C/watt. These results also suggest that when 
necessary, ceramic substrates could be glazed in only those areas as 
required for component performance and thus permit the use of low 
thermal resistance leads. 

A substrate dimension of obvious interest is the thickness. Results 
considering this parameter are given in Fig. 7(a) and (b) for the 
indicated rectangular substrates. For the substrate without leads, the 
change in thermal resistance due to a 40 percent increase in substrate 
thickness is insignificant whereas for the substrate with an isothermal 
boundary on one edge a 20 percent improvement in thermal resistance 
is possible for the small substrate areas. These conclusions are made 
by comparing the results of Fig. 3(b) and (d) with Fig. 7(a) and (b), 
where again the Biot number has been chosen to represent a thin 
alumina ceramic substrate with free convection from both faces. 

The coefficient of convection and the substrate thermal conductivity 
are also parameters that have significant effects on substrate heat 
transfer characteristics. If the dependent variable is chosen as ckRz, it 
is possible to consider the effects of changes in the coefficient of con- 
vection and the substrate thermal conductivity by considering various 
values of the Biot number since these two parameters always appear 
in this non-dimensional form. However, for the applications cited the 
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thermal conductivities are approximately 0.202 watt/em-°C for 
alumina ceramic, 1.35 watt/em-°C for beryllia ceramic and 0.020 
watt/em-°C for glass while the coefficients of convection range from 
0.003 to 0.010 watt/cm?-°C. These coefficients of convection represent 
the range from free to very moderate forced convection. Since this 
range is much smaller than the range of thermal conductivities, the 
results from the evaluation of these two parameters are presented 
separately. To illustrate the effects of the coefficient of convection, a 
rectangular substrate without leads is chosen and the results are 
presented in Fig. 8. As specific examples, the Biot numbers chosen 
can represent a thin alumina ceramic with free convection in equip- 
ment, free convection under laboratory conditions with small substrates 
and moderate forced convection. These results illustrate the need for 
adequate air volume and velocity around the substrate since the coeffi- 
cient of convection? is dependent upon both. 

All previous results were for Biot numbers in a range applicable to 
thin alumina ceramic substrates subjected to free or moderate forced 
convection. The following results will be applicable for beryllia 
ceramic and glass substrates under the same conditions. Since typical 
glasses used for thin film substrates have thermal conductivities ten 
to twenty times smaller than those of alumina ceramics, the results are 
obviously less favorable. However, if the average temperature term of 
(10) is considered, it will be noted that this term is most strongly 
influenced by the reciprocal of the product of the coefficient of con- 


ae 
g 





10° 10! 10% 10 104 10° 10! 102 10° 10% 
ab/fAxdAy ab /AxAy 


Fig. 7— Substrate thermal resistance: ae a AY a/e = 2b/c, At/a = Ay/2b, 
Xo/4 = Yo/b = 0.50, hic/k = hec/k = 
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—————} 
10° 10! 102 103 104 
ab/AwAy 
Fig. 8—Substrate thermal resistance for several Biot numbers: a/c = 2b/c, 
Az/a = Ay/2b, xo/a = yo/b = 0.50, hic/k = hec/k; (D) hic/k = 0.93 X 1073; (2) 
hic/k = 1.6 X 1073; (F) hic/k = 3.2 X 10-3, 


vection and the substrate area. Thus, the substrate thermal conduc- 
tivity has a minor effect on this term. As the source area-to-substrate 
area ratio approaches unity, this term is the significant term in the 
answer since the substrate begins to approach a uniform temperature 
which is also the average temperature. Thus, the conclusion can be 
made that if the source area is approximately equal to the substrate 





10° 10! 102 103 10% 
ab/AxAy 


Fig. 9— Thermal resistance of glass, alumina and beryllia ceramic substrates: 
a = b = 2.54 cm, c = 0.0635 cm, Ar/a = Ay/b, xo/a = yo/b = 0.50, hi = he = 0.003 
watt/em?-°C; (D) k = 0.020 watt/em-°C; (12) k = 0.202 watt/em-°C; (F) k = 1.35 
watt/em-°C. 
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area, the effects of low thermal conductivity substrates will be minimal 
but if the source area is very small, as is typical with semiconductor 
sources, the increase in substrate thermal resistance will be most 
significant. Fig. 9 illustrates these effects for a 2.54 by 2.04 em (1.0 
by 1.0 inch) square substrate. 

A direct comparison can be made between alumina and beryllia 
substrates by comparing the results given in Fig. 10(a) thru (d) 
with those previously given in Fig. 3(a) thru (d). By choosing sev- 
eral points from the corresponding figures, one can quickly demonstrate 
the advantage of the high thermal conductivity substrate since the 
thermal conductivities differ by a factor of 6.7 if the same thickness 
and coefficient of convection values are used. Specific results giving 





10° 10! 10° 109 104 10° 10! 102 10> 104 
ab/AxAy ab/AxAy 


Fig. 10—Substrate thermal resistance: (a) and (c) a/e = b/c, At/a = Ay/b, 
xo/@ = Yo/b = 0.50, hic/k = hec/k = 0.14 X 10-3; (b) and (d) a/c = 2b/c, Ax/a = 
Ay/2b, zo/a = yo/b = 0.50, hic/k = hec/k = 0.14 X 107%. 
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such a direct comparison are given in Fig. 11(a) and (b). These fig- 
ures demonstrate that for very small sources or for substrates with an 
isothermal boundary on one edge, the beryllia substrate can make 
possible approximately a 60°C/watt reduction in the substrate thermal 
resistance. Since adequate power dissipation is one of the present prob- 
lems of thin film and integrated circuits, the use of high conductivity 
substrates should be considered. 

The final numerical results illustrate the thermal interaction be- 
tween sources which must be considered if some components have 
temperature sensitive parameters. A synchronous clock logic circuit 
is chosen for this example. One version of this circuit consists of four 
flat packages appliqued to an alumina substrate and inserted into a 
socket. The socket design is such that the edge of the substrate which 
makes contact closely approximates an isothermal boundary. Each 
flat package contains two or three integrated circuit chips and dis- 
sipates approximately one-fourth watt. To complete this analysis the 
only required additional computation is to calculate the temperatures 
at other source locations and superimpose these effects. Fig. 12 gives 
the temperature profile of the substrate based on a one-fourth watt 
power dissipation from each flat package. It should be emphasized 
that the temperatures given are referenced to a heat sink and ambient 
of zero degrees centigrade and that the flat package and integrated cir- 
cuit chip thermal resistances must be added to the values obtained for 





102 
ab/AxAy 





3 ‘ 
BeO 


Fig. 11—Substrate thermal resistance: (a) a/c = 2b/c, Av/a = Ay/2b, xo/a = 
yo/b = 0.50, hic/k = hec/k = 0.14 X 1073, c = 0.0635 cm, & = 0.202 watt/cm-°C; 
(b) a/e = b/c, Az/a = Ay/b, 2o/a = yo/b = 0.50, hic/k = hoc/k = 0.14 X 1073, 
c = 0.0635 cm, k = 0.202 watt/em-°C. 


COOLING OF SUBSTRATES 1677 


en @ ee ee 





Fig. 12— Temperature profile (°C) per watt for synchronous clock logic circuit: 
a/c = b/c = 80, Ar/a = Ay/b = 0.20, xo/a = 0.25 and 0.75, yo/b = 0.25 and 0.75, 
hic/k = hec/k = 0.63 X 1073, c = 0.0635 cm, k = 0.202 watt/cem-°C. 


the substrate to obtain the overall thermal resistance. A reasonable 
temperature rise for the flat package and integrated circuit chip is 
9.5°C so that the hottest temperature in the circuit should be no 
greater than 85°C if it is assumed that the maximum ambient tempera- 
ture is 65°C. Less than a 5°C difference in temperatures due to inter- 
action between the four flat packages is also expected. Thus, the 
temperature sensitive components of this circuit should track reason- 
ably well. This ability to predict interaction effects between sources 
is obviously important for circuits with temperature sensitive com- 
ponents. 


IV. CONCLUSIONS 


The three-dimensional thermal problem of convection from the two 
large faces of a substrate with heat conducting leads has been solved 
for a substrate with isothermal edges, insulated edges or combinations 
of these boundary conditions. Results from this solution show the 
effects of substrate area, shape, thickness and thermal conductivity. 
Included are the effects of source area and the coefficient of convection. 
Two of the more important conclusions concerning heat transfer char- 
acteristics of glass and ceramic substrates follow: 
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(‘) The coefficient of convection has a significant effect on the 
thermal resistance of substrates with insulated edges or high resistance 
leads and becomes the dominant parameter as the heat source area 
approaches the substrate area. 

(ti) The thermal conductivity of the substrate is the dominant 
parameter affecting the thermal resistance of a substrate containing 
small area heat sources which are typical of beam-leaded integrated 
circuits. 
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Characteristics of Superconductor Strip 
Transmission Lines with Periodic Load 


By J. K. HSIAO 


(Manuscript received June 9, 1967) 


The characteristic tmpedance and propagation constant of a thin film 
superconducting strip transmission line has been derived by use of London’s 
two fluid model. It 1s shown that this line at moderate frequency has neg- 
ligible attenuation and dispersion. A periodically loaded cross film cryotron 
circuit 1s also analyzed. The attenuation, phase constant, and characteristic 
umpedance of this loaded line is given and related to the parameters of 
the unloaded line by the factor K which is the ratio of the gate separation 
to the gate width, 


I. INTRODUCTION 


This paper presents a study of the high-frequency performance of 
thin film superconducting transmission circuits. Particular attention 
is given to transmission lines between cryotron elements and between 
substrates each carrying many cryotrons. 

These interconnections are microstrip lines with very low char- 
acteristic impedances. Since the separation between the transmitting 
strip and the ground plane is very small in comparison to the width 
of the strip, edge effects can be neglected. This simplifies the analysis 
and gives an easy understanding of the propagation phenomena. 


II. MICROSTRIP TRANSMISSION LINE WITH SUPERCONDUCTING STRIP 


Due to the finite conductivity of the strip and ground plane in a 
non-superconductive microstrip transmission line, the phase char- 
acteristic has some dispersion and the attenuation is frequency de- 
pendent.? If the strip and ground plane are superconducting, the phase 
dispersion and the attenuation will disappear at frequencies below 1 
GHz. This was shown by Swihart? using Maxwell’s and London’s 
equation. Using a very simple but not rigorous approach, the character- 
istic impedance and propagation constant are presented to give an 
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understanding of this type of transmission line. The inductance, 
capacitance, and conductance in the dielectric region are the same 
as for non-superconducting strip line and have the following values:? 








€ Ah 

as (1) 

pes e (2) 
2b 

g==, (3) 


provided b/h < 1, 
where 


l’ is the inductance per unit length, 

c is the capacitance per unit length, 

g is the conductance per unit length, 

b is the width of the strip, 

h is the distance between the strip and the ground plane, and 

Me, € , and oa, are, respectively, the permeability, permitivity, and 
conductance of the dielectric material between the strip and ground 
plane. 

The internal impedance of the conducting strip and ground plane 
of the superconducting line are different. These are found by the 
following manipulation. Assume that a superconducting strip of thick- 
ness x = d and infinite width y, has a current flowing in the z-direction 
as shown in Fig. 1. The superconducting strip is immersed in a uniform 
dielectric material. The current is uniformly distributed along the y 
direction. London’s equation that is based on a two fluid model? in- 
cludes the following relations: 


DIELECTRIC 
Oe = CONDUCTIVITY 
Jte = PERMEABILITY 
€e = PERMITIVITY 


-SUPERCONDUCTOR, 
Oc =RESIDUAL CONDUCTIVITY 
pic = PERMEABILITY 
€ ¢=PERMITIVITY 
Ac=PENETRATION DEPTH 





Fig. 1 — Current flowing in a superconductor strip. 
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J=Jj.+ J. (4) 
VxXXNJ, =H (5) 
ne Z OJ) = E, (6) 


where 


J, is the superconducting current, 

J, is the normal conducting current, 

J is the total current, 

H and E are, respectively, magnetic and electric fields, 
\, is the penetration depth of the superconductor, and 
u, is the permeability of the superconductor. 


Maxwell’s equations applicable to both the superconducting and 
dielectric regions are as follows: 


Vps7. paid (7) 

VEBs0) “Be pH (8) 
dH 

VA ESS (9) 

vxu=(®) +3. (10) 


These equations are based on MKS units. 

From these equations, in the superconductor region, replacing J 
by J, + J, and then using (6) and (7) through (10), we have a gen- 
eral expression of London’s equation. 


1 
de 
where go, is the residual normal conductivity. This is the conductivity 
measured just above critical temperature. H,, »,-, and «, are, respec- 
tively, the E-field, permeability, and permittivity in the superconductor 
region. 

The three terms on the right side of this equation are contributions 
of superconducting current, normal current, and the displacement 
current, respectively. In the superconductor region the displacement 
current term can be neglected. Hence, (11) becomes 


al dE. 
es dt 


aki 


2 dE, : c 
VE, = aE. + ote re + eMe WE (11) 


V’E. = 





E, + ote (12) 
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In the dielectric region, we have 


dE. dE, 
WE, = Oebte TE eee ap (13) 


Use the coordinates defined by Fig. 1 and assuming the fields to be 
sinusoidal with respect to time, 








@E, (1... \. 
re a (4 + jot), (14) 
aE, ; 
re (—w'ee + joou)H,, . (15) 


The solutions of the above equations are, respectively, 


E,, = A, cosh kx + ,A, sinh kyx Cae =a 





L,, = B exp (—k.2) C= 2. 
where k, and ke are defined as 
1 . a1 
k= . (1 + jwo.mede)? (16) 
ky = jwV ueee (1 _ j%.] . (17) 


In the superconducting region, we retain two solutions in order to 
match the boundary condition at x = d, while in dielectric region we 
retain only one solution because we assume that the dielectric ma- 
terial is uniform and extends to infinity, and there is no reflection 


wave. 
The H-fields in both regions are, respectively, 


jou H,, = —A,k, sinh ky — A.k, cosh kyx 
jou.H,, = Bkz exp (—kp2). 
At the boundary x = d 
Bye Beg Hyg = ges 
Hence, 


k,/k, smh k,d + cosh k,d 


AON ed Chk coh kd be ee OS) 
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At 
x=0 
B.S 
i. = ne A.k, e 
joe 


By use of Ampere’s Law the current inside the superconductor can 
be found. For unit width, it is 








15 OM Ps tie Oral Por 
= 3 J (= Aki + Asti sim fd + Ask, cosh Ind). (19) 
The internal impedance of a conductor is defined as* 
Zi= o ohm/m’, (20) 


where Ep is the surface E-field and I is the total current in the con- 
ductor for unit width. Hence, 


Zi = joe _k2/k, + coth kid (21) 


k, 1+ ko/k, tanh k,d/2- 
The classic skin-effect depth of a normal conductor is defined as* 
nan ae 
Varfucte 
From this and (16), for the superconducting region 
—_ A (1 i. aj Me). 


For several common superconducting materials, the classic skin-effect 
penetration depth 6 are listed as follows: 


Conductivity 
Transition’ at transition ry 6 
temperature temperature at f = 108 Hz at f = 10° Hz 
Lead 7.22°K 0.52 X10" mho/m 0.7 X10A 0.216 x 10!A 
Tin 3.74 0.896 * 10° 0.53 & 104 0.167 X 10 
Tantalum 4.38 0.806 X 109 1.77 X 105 0.56 X 104 
Indium 3.374-3.482 0.386 10° 2.65 X 10! 0.84 X 10! 


The pentration depth 2, is in the vicinity of thousand A (for lead, 
it is 500A, while for tin it is 1500 A). Hence, in general, at a frequency 
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10° Hz d,/6 = 1/10. This is assumed at a temperature 7’ which is 
at least 0.1°IK below transition temperature. Under this condition, 
for a good approximation 


ky =& - 
The o,/we, term in (17) is defined as the loss tangent of dielectric 
material. For most dielectric material at room temperature, it is ap- 
proximately 107° to 10°*.° For SiO, it is 10°? at 1500 Hz. There is 
no available data at helium temperature. However, for reasonable 

approximation we can say that 
Oe 


<1, 


WE, 


Hence, 





ke FS jw Wie. 
. 20 


ae oe 


where A, is the wave length in the dielectric. For SiO its relative permi- 
tivity is approximately 5.’ Hence, A, 74.5 cm at 10° Hz. 
The | kz |/| &, | ratio is then 


de = 
| ke /| hs la 10 - 


Under this assumption, (21) can be approximated as 
Z’ = jop,d, coth d/rA, ohm/m’. (22) 


Next let us assume that a superconducting strip transmission line 
is formed by two strips immersed in a dielectric material as shown in 
Fig. 2. For this line, its series impedance is the sum of the Z‘ and the 
inductance in the dielectric; hence, its series impedance and parallel 
admittance are, respectively 


Nes Ne 
h coth d,/d,, + ; 








Z = joy, : (1 + 2 coth da/Res) (28) 
fea 
Y = Jwee h (24) 


If the dielectric loss and classic skin-effect loss is not negligible, then 
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SUPERCONDUCTOR STRIP 
—--—— oe, ber Oy 
REGION 1 


SUPERCONDUCTOR 
____ GROUND PLANE 
Ace »Bhe> To 
REGION 2 





Fig. 2— Superconductor Strip Transmission Line. 


2\-4 
ee. ; E a nes (1 + 9j Xe) Bot ed 











XG. A2\3 
a a 1+ 2) 52 coth kod (23a) 
2 
. Ob an Fs 
ae (1 med z..). (24a) 


' The characteristic impedance and propagation constant of this line 
can be found by use of the following relations: 


Z.= VZ/y 
ye = VZy. 
Hence, by use of (23) and (24), we get 
Fix fet (1 pee a ane 
e, b h 


Ve = JoV Meee (1 + nes coth d,/\., + wa coth d./.) 


t 


I 


1 
2 


* coth da). (25) 





de 
h 








ey 
2 
. 


(26) 


Set y. = j8.. 
Then the phase velocity of this strip line is 


BS 


2 
m/sec. 


r 


u P 
h 


V bee 


And its delay time is 


Nes 
h 











(1 + = coth d,/\.. + coth da/d..) 


BS 


T= Vie (1 + nes coth di/d., + nes coth da/d.,) sec/m. 
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While by use of (23a) and (24a) we get 








zy = [Heh 
Z.= 5 
E + . (1 + 2r2,/6;)7? coth kid + tes (1 + 2jN2,/ 82)? coth ta | 





(1 — jo./we,)? 
(25a) 


Ye = joV wee. (L — jo,fon}| 1 + wee (1 + 2jr2,/8:)? coth kd 





+ ne (1 + 2jr2,/82)~? coth tal | . (26a) 

At frequencies below 1 GHz with the temperature at least 0.1°K 
below transition temperature, (25) and (26) give fairly accurate re- 
sults, providing the dielectric loss is negligible. Then the characteristic 
impedance is a real number with negligible frequency dependence. 
The propagation constant is directly proportional to frequency; hence, 
its group velocity and phase velocity are the same and there is no 
attenuation. 

Fig. 3 shows the characteristic impedance and delay time of some 
superconducting strip lines with various dielectric thickness (h). 

It is shown that the thickness (d;) of the strip line film has little 
effect on the characteristic impedance and delay time. However, the 
characteristic impedance changes proportionally with dielectric thick- 
ness (h) while the delay time 7, decreases nonlinearly by only 20 per- 
cent for an order of magnitude change in h. 


Ill. A SUPERCONDUCTING STRIP LINE WITH PERIODIC STRUCTURE 


The cross film cryotron consists of a control strip and a gate strip 
crossing and perpendicular to each other. In a memory circuit or in a 
tree-type selective circuit, a single control strip usually crosses many 
gate strips. At each intersection there exists coupling between the 
control and gate strips to form a periodic structure. The characteristic 
impedance and propagation constant of the control line are functions 
of the periodic loading. (Refer to Fig. 4.) 

The control line and gate line are assumed to be terminated with 
their respective characteristic impedances Z, and Z,. The control line 
is also assumed to be uniform without discontinuity except for the 
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Vig. 3— Relation between characteristic impedance delay time, and dielec- 


tric thickness for superconducting thin film strip line. 


periodic loading of the couplings to the gates. The equivalent circuit is 


shown in Fig. 5. 


In Fig. 5, y, is the propagation constant of the control line, Y, is the 
coupling admittance between gate and control line. The characteristic 
impedance and propagation constant of this periodically-loaded line is 


as follows (see Appendix A): 


YZ, el 
fo = ar ~ ginh 2y,l + YZ, cosh” —| 
vo = cosh™ [cosh 2y,1 + YZ,/2 sinh 2y,]], 


where 


ee eee me 
Y — 1 =f 4Y.Z, l ~~ 3(d, + W,,). 


(28) 
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a aaa.. 


h 






—-—-GROUND PLANE 
Fig. 4— Periodic gate crossing of a cryotron circuit. 


If the ratio of the control line width b to the distance h, between 
gate and controls lines are large (b/h) > 10’ this capacitance to a very 
good approximation is® 

Cae, (298) 
h 
where A is the intersection area of the control and gate lines, and e, 
is the permittivity of the insulation material. 

Since the magnetic fields in control and gate lines is assumed 

orthogonal to each other; therefore, there is no magnetic coupling, and 


Y, = pw. 


If we want to take into account of the dielectric loss of the insula- 
tion material, Y, becomes 





ae (1 ~ j%-) : ———— (29b) 

h WE, 

where o, is the conductivity of the insulation material. For Si0, o,./we, 

is approximately 10-°.7 Hence, this term can be neglected in this case. 
By use of this result, we find 


eke SOs = 3p 
1+ 390CSZ, 
For a typical cryotron, the width of the gate is approximately 20 


milli-inches and the width of control line is 5 milli-inches. The insula- 
tion material SiO has a relative permittivity of 5. If h’ is 5000 A, then 


Y (30) 
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es a6-<j p20 XB 254 X10" 
C= 3. X 10° X 5 Biot 


Cx 5.7 X 10°" farads. 





If the gate is terminated by its characteristic impedance, then Z, 
is approximately one ohm for an ordinary cryotron. Hence, 


w0Z, = 6 X 10° 

at a frequency of 100 MHz. Therefore, (30) can be simplified as 
Y & joC(1 — 3jwCZ,). (31) 
We have shown in Section II that the propagation constant of a 
‘ superconducting strip line is an imaginary number; therefore, we set 

Ye = JB. - 
Hence, 
a wZ (1 — 4joCZ,) i 

Aer Z|. sin OF CZ. = uz) cos Bl | OY 


Tor typical cryotron circuits, the spacing between gates (d,) is 
about equal to the gate width (W,) for maximum compactness. At a 
frequency of 10° Hz, 


B,l ~~ 107° radians. 


Hence, 


sin 2 6,1 ~ 2 B.l 


cos’ 2 B.l & 1. 





Fig. 5 — Equivalent circuit of a periodic loaded cryotron. 
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Neglecting the (wC)’Z,Z, term, we find 


1 5 
sae eee 1k 
wCZ, 


By use of (25), (26), and (29a), we find 
28.1 _ h’(d, + W,). 





wtZ, hW, 
If h’ = h, then 
= W, + d, ) 
a PACE + d, 
Setting 
2 RW. 
e Sgeiee Ky 
iy = Z(H) 


(33) 


(34) 


(35a) 


(35b) 


For maximum package density, the gates are placed as close as 
possible, and thus, d, and consequently K are made as small as pos- 
sible. However, to avoid interference between adjacent gates, it is 
usual to set the distance between gates at least equal to their width. 
For this condition, Z, = 0.815 Z,. As K becomes larger, Z) approaches 


Z, in value. 


In the next step, the propagation constant of the periodic-loaded 


line is determined. 
First substituting the following condition in (28): 


Ye = JB. 
sinh 2y,1 } 276,1 


cosh 2y,l 1 — Cee. 


and set 
Yo = A + Bo . 


Equation (28) can be rewritten as 


cosh (@ + 78) = 1 — oe 


— wOZ.B.l + jf8C)YZ.Z Bl. 
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It is noticed that the real part on the right side of this equation 
is very close to but less than 1. The imaginary part is very small. 
Hence, we conclude that a and 8 must be a small quantity. The 
real and imaginary parts of this equation become, respectively, 


I 


2 
cosh ay cos By = 1 — as — wlZ.B,l (36a) 


sinh ay sin By = 4(@C)’Z,Z,B,1. (36b) 


Using the approximate relationship 
2 
cosh a Y1 + = 





cos Bp 1 — & 
sinh ay LY a 
sin Bo & Bo 
Li POL; 
and defining the following constants [see (34)]: 
oa sar a =R 4 (37) 
26.1 = 6, , 


where @, is the phase shift between gate crossings along the control 
line without loading, (86a) and (36b) can be rewritten, respectively, as 


QoBo = 1R°8 . 
Neglecting the a383 term, we find 
Bo — % = + REG; 
QoBo = TRO: . 
Hence, 


Bo = a 0c + m1 4 (1 = ier) It. 


1(R’9,.)” K (1 + R)’, 


Since 
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GB, becomes 





a | , 1 Re : 
Further neglecting R*62/(1 + R)’, then we obtain 
Bo = 6(1 + R)? 


= 19 ae 
Ho = 38. ‘+ RP 


Replacing R by (387), we obtain 


_ ,{d, + 2W,\' 

nee a (% s pe) (398) 
Sait sts VG ak ae, 

ao = 48 Gaia, + aw)? (39b) 


Using the relation d = KW,, then 





2 eee xy = 
Bo = . 1K) = phase constant (40a) 
Qo = Be = attenuation constant. (40b) 


"K+ DK + 2 


Equations (40a) and (40b) are phase constant and attenuation per 
period.* If there are n gates crossing in one meter length.of control 
line and if they are equally spaced, then the phase constant and attenua- 
tion per meter is 


B = B.(d, + wn(2+ % ok xy rad/mn 


: x neper/m. 


ad, + We)” GIG + wD! 


R 
I 


Replacing 8, by 7, and realizing that n = 1/d, + W,, we find 


ee rool pt EY ridimna/m (41a) 


a = iw W, CE TE ee (41b) 


*One period is the distance (d, + W,) between cryotron gate crossing of 
the strip line. 
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The delay time per meter of the loaded line is 


— (2+ Ky 
T= (2+ 1K sec/m. (42) 

For a typical cryotron having control width 0.005 inch and gate 
width W, = 0.02 inch, SiO as dielectric of a thickness 5000 A, and 
kK = 1, then its attenuation at 10° Hz becomes 


2.56 X 107° neper/m 


I 


a 


or 


lI 


2.12 X 10°? dB/m. 


With this configuration, on a 3” X 3” substrate from one side to 
the other side of the substrate, 75 cryotron can be laid down. The 
attenuation will be only about 5 X 10-* dB. This is extremely small. 
For larger K, this attenuation will still be less. Hence, for our purpose, 
it can be neglected. The delay time for this particular example is 
approximately 10 nanosecond per meter. For the same substrate, the 
time for a pulse to travel from one side to the other side of the substrate 
is approximately one nanosecond. 

Fig. 6 shows the relation between characteristic impedance Zp, 
delay time 7) and the ratio (K) of gate separation (d,) to gate width 
(W,). From Fig. 6 it is shown that the characteristic impedance Z, 
of loaded line is less than the characteristic impedance Z, of an un- 
loaded line at smaller K (at K = 1, Z ~% 0.82 Z,). As K becomes 
larger, (K > 10) Z./Z, ratio approaches unity. The delay time 7, 
per meter of a loaded line is larger than the delay time of an unloaded 
line (at K = 1 7) = 1.22 7,). However, their ratio also approaches 
unity as K becomes larger. 


IV. CONCLUSION 


Section IT derives the characteristic impedance and propagation con- 
stant for typical thin film superconducting strip lines used for inter- 
connecting cryotron elements. The characteristic impedance is shown 
to be a real number with negligible frequency dependence. The propaga- 
tion constant is shown to be directly proportional to frequency and 
hence the group and phase velocity is identical. It is found that the 
transmission performance of these lines are, for practical purposes, 
independent of film thickness when in excess of 500 A. Fig. 3 shows 
the characteristic impedance (Z,) and delay time r, for a film strip 
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Fig. 6— Relation between characteristic impedance (Zo) delay (_ ), and 
ratio (k) of gates separation (d) and gate width (W,) of superconducting strip 
line with periodic gate crossing. 


of 5 milli-inches width on SiO dielectric. For other widths, Z, is in- 
versely proportional to width and 7, is independent of width. For 
example, at a dielectric thickness of 5000 A and film width of 5 milli- 
inches, Z, = 0.725 ohms and +r, = 8.2 X 107° sec/meter. 

Section III derives the characteristic impedance (Z,.) and propagation 
constant for thin film superconducting strip line used as a common 
control which crosses a series of cryotron gates periodically spaced. 
This is related to the transmission characteristics (Z., 7.) for the 
strip line if they did not cross cryotron gates. Fig. 6 shows this relation 
with plots of characteristic impedance and delay time per meter versus 
the ratio (K) of distance between gate crossing with gate width. For 
high packing density K = 1, it is found that the characteristic impedance 
is reduced almost 20 percent due to the loading of the gate crossings. 
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The delay time 7, is increased 20 percent. The attenuation constant 
(ao) is extremely small, hence it can be neglected for practical purposes. 
APPENDIX A 


Referring to Fig. 5 from point 1-1’ to point 2-2’ the control line 
has characteristic impedance Z, and propagation constant y,. The 
voltage and current equations in terms of the Z, and y, are’’*°""?"” 


V; coshy,.l Z,sinhy,l| | Vo 
= ila ‘ (48) 
I, z. sinh y.l cosh y,l Ts 
where 
l= 3(d, + W,). 


From point 2-2’ to point 3-3’, the voltage and current relations are 




















7 
V3 = 1 0 J 3 (44) 
I; Yd I; ; 
where 
y Y 


 14+4Y.2Z, 

If the gate lines are terminated in their characteristic impedance 
then Z, is the characteristic impedance and a real number independent 
of frequency. Otherwise, Z, might be a complex and be frequency 
dependent. 

Similarly, from point 3-3’ to 4—4’ we have 


Y; coshy,l 2Z,sinhy,l|} V, 
=|, . (45) 
1B Z. sinh y,l coshy,l LT; 
Hence, for each period, we have 
V,{|_|A B | | Vi 
ie CoD ay 














where 


pa 
| 


= cosh 2y,1 + YZ,/2 sinh 2y,l, 
A=D 


? 
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and 
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B = Z, sinh 2y,l + YZ? sinh’ y,l, 


C = = sinh 2y.1 + Y cosh’ y,l. 


id. 
Ze 


Since A = D this is a symmetrical circuit, accordingly its charac- 
teristic impedance Z,) and propagation constant Yo are 


Zo = VB/C 


(46) 


y E 7 YZ, } 
i sinh 2y,l + YZ, cosh’ y.l 


eo ~1 
Yo = cosh” A (47) 


I 


cosh™* [cosh 27,1 + YZ,/2 sinh 2y,l]. 


LIST OF SYMBOLS USED 


Ll’ 
C 


g 
W dsb, h, d, 


Me 


= Inductance 

= Capacitance 

= Conductance 

= Geometric Parameters 
Permeability of dielectric material 
Permittivity of dielectric material 
Conductivity of dielectric material 
Permeability of superconductor 
Permittivity of superconductor 
Residual normal conductivity of superconductor 
Current Density 

Electric Field 

= Magnetic Field 

= Angular velocity 

Penetration depth of superconductor 
Classic skin-effect depth 
Impedance 

Characteristic _ Impedance 
Progapation constant 

= Phase constant 

= Wave length 

Delay time 


I] 


ll 


ll 


I 


I 


I 


I 


I 


I 


I 


I 


I 
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Scattering Relations in Lossless Varactor 
Frequency Multipliers 


By C. DRAGONE and V. K. PRABHU 
(Manuscript received May 22, 1967) 


In recent years, the use of varactor diodes for harmonic generation has 
become increasingly widespread. Varactor harmonic generators come under 
the general class of pumped nonlinear systems, which are networks driven 
periodically by a pump or a local oscillator at a frequency w» and zts har- 
monics. For such systems, a general method has been presented in this 
paper to obtain the scattering parameters which relate the small-signal 
fluctuations present at various points in the system. In particular, the 
scattering parameters of lossless abrupt-junction varactor harmonic gen- 
erators of order 2”, 3°, and 2"3° with minimum number of idlers have 
been obtained. It has been shown for these multipliers that there 1s no 
amplitude-to-phase or phase-to-amplitude conversion if fluctuations are 
in the vicinity of the carriers. With minor modifications this theory can 
be extended to the study of lossy varactor harmonic generators. 


I. INTRODUCTION 


The carrier voltages and currents present in a varactor frequency 
multiplier are perturbed by small amplitude and phase fluctuations 
due to a variety of causes, such as noise, synchronizing signals, etc. 
In some applications, these fluctuations may be due to modulations 
purposely applied to the carriers. An example of such applications is 
that in which a frequency modulated signal is multiplied in frequency 
to increase its modulation index. It is the purpose of this paper to 
study how these perturbations propagate in the circuit of a multiplier. 
In other words, this paper considers the problem of determining the 
small-signal behavior—a problem which is of basic importance in un- 
derstanding the problem of stability and noise performance in high 
efficiency varactor multipliers. 

+See Ref. 1. The problem of stability is also treated in a subsequent paper.? 


Part of the results obtained in this paper represent generalizations of some of 
the results presented in Refs. 1, 3, and 4. 
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In the earlier part of this paper, a general method has been presented 
in order to obtain the scattering parameters of pumped nonlinear sys- 
tems which are networks driven periodically by a pump or a local 
oscillator at a frequency{ w9 and its harmonics. Harmonic generators 
discussed in this paper come under this class of systems. Some of the 
formalisms usually used to describe the fluctuations in these systems 
are also briefly reviewed. 

In the second part of this paper we discuss varactor multipliers in 
which the diode is not overdriven and is of the abrupt-junction type. 
The equivalent circuit of this type of multiplier consists of an ordinary 
linear, passive, and time-invariant circuit connected to the time-vary- 
ing component of the elastance S(t) of the varactor.® In general, it is 
shown that a complete solution of the small-signal behavior of such 
a circuit requires that S(t) be known. On the other hand, it is well 
known that certain properties of the small-signal behavior of a multi- 
plier do not depend at all on the particular form of S(t). For instance, 
a general and well-known property of a multiplier of order N is that 
slow fluctuations in the phase of the input drive produce N times as 
large fluctuations in the phase of the output signal. One of the main 
results of this paper is that, under certain general conditions, many 
other properties of the multiplier are related in a simple way only 
to the order of multiplication N. All the small-signal characteristics of 
a multiplier that are of practical interest can, therefore, be readily 
determined without having to calculate S(t). 

Specifically, we consider a lossless multiplier of order N = 2°3° = 
2, 3, 4 ete., which is tuned at all carrier frequencies§ and has the least 
number of idlers. Then, if the various small-signal fluctuations of such 
a multiplier are properly normalized with respect to the corresponding 
carriers, one finds that the small-signal terminal behavior of the elastance 
S(é) is completely determined by N only. It is important to point out 
that this is exactly true only for w < w» , where w is the frequency of the 
fluctuations and wp» is the carrier frequency of the drive. If this in- 
equality is not satisfied, then the small-signal behavior will also depend 
on w. A consequence of these results is that the AM and PM scattering 
parameters of the multiplier of order N = 2”3° considered in this paper 
only depend on 7 and s, in the vicinity of the carriers. They are given, 
respectively, by the two matrices 

[In this paper the word frequency has been used exclusively for the angular 
frequency of a sinusoidal signal. If f is the frequency of a signal in Hz its angu- 
lar frequency w is given by w = 2zf in radians/second. 


§ Tuning of idlers, and input, and output circuits usually gives near optimum 
efficiency 2:78 
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It is important to point out that for the above multiplier it has been 
assumed in deriving the results that the bias circuit is properly designed 
so that there are no low-frequency fluctuations of the average capaci- 
tance of the varactor diode.° This assumption leads to the result that 
there is no amplitude-to-phase and phase-to-amplitude conversion if 
w/a KI. 

Several other results are also presented in this paper. For instance, 
it is shown that, if the number of idlers is minimum, then an abrupt- 
junction varactor multiplier of order N = Nyx N2X...xX Ny is 
equivalent to a cascade of n multipliers of order N,, No,..., Nn. If 
the varactor is not overdriven, this property furnishes the basic equiv- 
alent circuit for studying the properties of most of the higher-order 
multipliers encountered in practice (NV = 4, 6, 8, etc.). 

Finally, it is important to point out that techniques presented in 
this paper are applicable to the derivation of scattering parameters 
of multipliers, of any order, with any arbitrary configuration of idlers, 
and using a varactor diode having arbitrary capacitance variation and 
drive level. We only assume that the elastance S(t) of the diode used 
in the multiplier has a Fourier series. 


II, SOME CONSIDERATIONS OF PERIODICALLY DRIVEN NONLINEAR SYSTEMS 


As mentioned earlier in this paper, frequency multipliers come under 
the general class of nonlinear systems driven by a strong periodic car- 
rier. It is our interest to study in this paper how small perturbations 
on the periodic driving of such systems are propagated, and to this 
end we shall give a brief introduction§ of a circuit theory which enables 
us to relate the perturbations at different parts of the system. The 
perturbations or fluctuations that we would like to analyze could be 
caused by desired or undesired modulation, noise, hum, or synchroniz- 
ing signals. The origin of these sources of fluctuations is not relevant 
to our development of this theory. 

Let us consider a nonlinear system. It is our assumption that the 

t The conditions under which a periodic time function z(t) has a Fourier series 
are well-known; and can be found in any book on Fourier series. See, for ex- 


ample, Ref. 9. : 
§ See Ref. 10 for a more detailed account of this theory. 
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large-signal voltages and currents at various parts within the system 
are, by design, periodic with some frequency wo. Thus, the voltage at 
some specific point within the network or across one of its terminal 
pairs, v(t), is of the form 


ie bs Vises Gia: (1) 


where the V,’s are half-amplitudef Fourier coefficients, with V_, = 
V*. However, the actual voltage v(t) may deviate from (1) because of 
fluctuations present in the system. Thus, 


fae] 


v(t) = > Vi exp (jkwot) + dv(2), (2) 


where 6v(t) is small compared to v(é) in (1). The circuit theory that we 
shall use in the rest of this paper is one which describes perturbations 
dv(t) and relates them to similar perturbations of voltages and currents 
in other parts of the system. The perturbations are assumed to be small 
and they are at frequencies close to the carriers.|| 

The carrier voltage at some particular point in the system is of the 
form 


V, exp (jkwot) + Vi exp (—jkwol), (3) 


where V; has some phase angle ¢,,. The actual voltage v,(#) in the 
vicinity of this carrier deviates from (3) because of the perturbation 
5u,(t); 


v(t) = V, exp (jkaot) + VE exp (—jkwot) + 6v,(Z). (4) 


Similar expressions can be written for currents and voltages at various 
places in the network. The various voltages like v(t) obey Kirchhoff’s 
voltage law, and various currents 7(t) defined at various points in the 
network obey Kirchhoff’s current law. Furthermore, the carrier voltages 
and currents at various points in the network obey these Kirchhoff’s 
laws, leading us to conclude that the perturbations like dv(¢) and 6z(t) 
also obey them. 

Let us now assume that the perturbation 6v,(¢) contain frequencies 
that are located in a band of width 2w, centered about frequency kw» 
where 20, < w).'° We can write’ »,(t) ast 

| Note the use of half-amplitudes, rather than amplitudes or rms values. 

| The large signal voltage or current present in the system at frequency +kw 
will be referred to from hereon as the carrier voltage or current at that frequency. 
In frequency multipliers carriers are at different frequencies at different parts of 


the system. 
+ In writing (6), it is assumed that | vp.(é) |/| Vi | « 1 for all ¢. 
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v,(t) = 2 Re [| V, | + var(t) — Wo(t)] exp [j(kwot + ¢.x)] (5) 





mw 2Re ll Veltri ow {i kat ten- 22], 


where v,;(t) and v,,(t) are slowly varying functions of time. The voltage 
v.,(t) can be interpreted, since it is small, as a perturbation on the 
amplitude | V, | of the carrier. Similarly, voltage v,,(£), because of (6), 
can be interpreted as a perturbation on the phase kwot + ¢,, of the 
carrier. We shall refer to v,,(t) as amplitude (AM) fluctuations and 
to v,,(t) as phase (PM) fluctuations. Similar AM and PM fluctuations 
can be defined at various points in the system. 
If these AM and PM fluctuations are sinusoidal, we have§ 
Va(t) = Vax exp (jot) + V8, exp (—jol) (7) 
and 
Vpe(E) _ Vox exp (jot) ar Vor exp (— jot). (8) 
The actual voltage v,(#) is then given by 
v(t) = 2 Re [| Vi | + (Var — jV ox) exp (jot) 
+ (Vi — jV%) exp (—jut)] exp [j(kaot + 2)] (9) 
= 2 Re {V, exp (jkwol) 
+ Vax exp [j(kwo + w)t] + Ve, exp [j(—kwo + w)t]}, (10) 


where V.., Ve., Vaz, and V,, are related. The relation is 


i cae 
, at = aa Vo] ) (11) 
Vow V ex 
where the matrix },, is a function of only the carrier phase angle ¢,, . 
The matrix },, can be represented as 


ae os ae 1 \ (— Jeon) 0 | (12) 
p+ y 0 EXP (J¥on) 


Equation (10) shows explicitly the three frequencies kw), kwo + ao, 
and —kw) + w. The two sidebands here are both higher in frequency 
than kwy , and —kw , respectively, and therefore, these representations 
are referred to as upper sideband (a — £8) representations. We will 
use them along with the representations of the form (9) in the rest 


§ Since the fluctuations va.(t) and v,.(t) are band limited around dc, w must 
be less than w- in magnitude, where 2w. < wo. 
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of this work. Their mutual relation is given in (11). Because of (10) 
we shall refer to w as the fluctuation difference frequency. 

Let us now consider a pumped nonlinear system exchanging power 
at the carrier frequencies -kwo , +2wo, --- , nw. A nonlinear system 
exchanging power at a number of frequencies can be considered as a 
multiport multifrequency system as shown in Fig. 1. In Fig. 1 the 
system exchanges power at n carrier frequencies and it is assumed, 
without loss of generality, that no two ports exchange power at the 
same carrier frequency. Let the perturbation voltage and current 
at port k be denoted by 6v,(¢) and 67,(t), respectively. Since 6v’s are 
small, they must be linearly related.{ Hence, there is a relation which 





Fig. 1— Pumped nonlinear system exchanging power at n carrier frequencies. 


relates 6v, to 62’s of the form 


an) = Of halt, 2) 0i,(t — 2) dr, (13) 
j=1 J~0 
where h,;(t, 7)’s are functions of time ¢, as well as of time difference 7. 
Since the driving is periodic, if 62’s were applied one period later, 
6v, would be the same, except that it would be delayed by one period. 
This argument leads to the conclusion that h,;(é, 7)’s are periodic 
functions in ¢, with period 7’, = 27/w) and can be expressed in a Fourier 
series of the form 


+ This is because only first-order terms in 6v’s and 8i’s are retained. Higher- 
order terms are assumed to be negligible even when first-order terms vanish. 
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hy;(t, 7) = p> (hij)i(r) exp (jlaot), (14) 
where (h,;);(7) is a function of 7. Upon substituting (14) in (18), we 
find 


wi) = = p3) Gd) / ; GNOROa ie. GS) 


If 6v,(¢) is represented in the a — 8 form, 


6v,(t) = 2 Re {Vaz exp [j(kwo + w)t] + Va, exp [j7(—kwo + w)t]}, (16) 
we find{ 


Po = a Z ak pi Le Z akak Z akBk aes: Aeron a (17) 
V pk Z peat Z expr aad Z pk ak Z supe meee Z eran Z pk Bn 


where§ 


DSi ta yee pd ee gla go el wa ys (18) 


Ill. SMALL-SIGNAL ANALYSIS OF PUMPED NONLINEAR SYSTEMS 


For the nonlinear systems that we shall consider in this paper, 
we shall assume that the total voltage v(t) across the nonlinear element 
is related to the current 7(t) through it by the equation] 


v(t) = Fi}, (19) 


where F{7(t)} is a single-valued functional of 7(¢). 

Assuming that there are carrier currents flowing in the system at 
frequencies -iw) , 0 < 7 S n, the spot frequency terminal behavior of 
this system at a difference frequency w is given according to (17) by an 
equation of the form]|| 


t Essentially, we are discussing impedance formalism here which relates volt- 
ages to currents through an impedance matrix. Several other kinds of formalisms 
like scattering matrix representation or chain matrix representation can also be 
used to relate other desired sets of variables. 

§ A column matrix a is written in the form {a, a2, ..., an}, the curly braces 
being used to identify it as a column matrix. 

{lf there are any physical sources of fluctuations (such as noise sources) in 
the pumped nonlinear system, (20) is to be suitably modified. For the discussion 
of as oe in which noise sources may be present in the pumped nonlinear system, 
see Ref. 11. 

1! It must be pointed out that this equation only relates the small-signal fluctu- 
ations present in the system and not the carrier voltages and currents. 
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r 
V ao Z aad Z dai Z opi Peron ZL avai Z opi iain A eee Z aopn Ino 
Var Zerad Zerat Z aipi moos Lariat Zaipi Sense Lint ee Z eipn Tas 
Var ZB100 Zaiai Zeigi Fates Zpiai Zpipi eae Zeran Z ign Iai 


Vai 7 Zaiad Z vial Z wifi sae AMEE Z a ipi iE atei Latin Z aipn Lai ’ (20) 
Ve: ZBiad Z pict Zpisr os 55 LZ piai Z pipi eS Zeian Zpipn I; 


Van Zanad Leen Z anBi aivacg ee Z ani vars Ls, Z eanBn ‘ee 
L Vien Zbnao Zana Zang ces Zanei Z pngi east Lani Z Bn Bn I, 


where V,; and Vz; are the terminal voltages at frequencies jwo + w 
and —jw,o + w, respectively; and I,; and J,; are the corresponding 
terminal currents. We would like to note here that Vio = Vago is the 
small-signal terminal voltage at the frequency w. We shall, for brevity, 
write (20) as 





(Vi-@)n = (Za~p)n(La-p)n . (21) 

Let us now specifically consider a varactor diode which is pumped 
at a frequency w, and its harmonics. The varactor model that we shall 
use is shown in Fig. 2. It is a variable capacitance in series with a 


S(t) Rs 


Fig. 2— Varactor model. 


constant resistance R,.{ The instantaneous varactor voltage v(t) can 
be written as some function f of the charge, plus the drop across the 
series resistance R, : 


v(t) = fla] + RaQ), (22) 


where 
Hire [ i at. (23) 


For such a varactor, we can make use of the small-signal equations 
given in Ref. 5 in order to obtain the impedance matrix Z,_, in (21). 
If the elastance S(t) of the varactor diode can be written in a Fourier 
series of the form 


Si) = s S;, exp (jkwot), (24) 


+ Mainly we shall be concerned with varactor diodes which are lossless in the 
succeeding sections of this paper. For a lossless varactor diode R, = 0 


the matrix (Z.-s), can be represented as§ 


So 
Jw 


Si 
jo 


* 


jazi ls 


Jw 





(Za) n = 





Sr 
Jw 
Ci 


L jw 





























S¥ S, 
lwo + w) j(—w + ») 
So S» 
J(@o at w) j(— wo Ss w) 
SF ese eee 
Jo +) j(—w. + ») 
— Siva Seni 
(wo a w) J(—@o + w) 
Shea Shi 
J(wWo + w) j( ao + w) 
Suet Davi 
jlo +) j(—ao + w) 
ee Dat 
jlo + &) (wo + w) 


S? 
j(two + w) 
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Shea 
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So 
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SH 
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Ore 
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Sint 





j(— 109 + w) 


Sezi 


j( te oe w) 


Soi 
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So 
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Se 


j(—tw + w) 


§ We have put &, = 0 in order to obtain (25). If Rs 4 0, (Za-g)n = {(Za-g)n 


in (25)} + Relonyi. 1, is the unit matrix of order n. 


Ny 
(nay + w) 
Sani 
jw + w) 
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(ne a w) 
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Sri 


(nwo =P w) 
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J(nwo + w) 
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j(nay + w) 








Die 3 
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j(—Nw + w) 
Snot 
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Equation (25) shows that the impedance matrix (Z.-,), always exists 
for a pumped varactor diode as long as the elastance S(é) is expressible 
in the form (24). 

Let us now assume that the input carrier frequency is lw. ,1S 1S n; 
and that the output carrier frequency is sw, 1 S s S n (see Fig. 3). 
Let us also assume that the terminal constraints at other carrier fre- 
quencies are such that 


VS —Zooel, (26) 


where V’ is an a — @ terminal voltage column matrix given by 


V ao 
Var 
Ves 


V acti) 
V pai) 
V acti) 


Vy’ = Macon i (27) 


Visa’ 
Vpcs—1) 
V atest) 
V gc+1) 


- 

V bn | 
I’ is the corresponding terminal current column matrix. Z’,-» is the 
impedance matrix determined by the terminal constraints imposed by 
the external circuits on the system. These terminal constraints at all 
carrier frequencies excluding lw) and sw) are assumed to be known. 


Even though the currents flowing in the varactor are not limited by 
the diode in the range of available frequencies, it is assumed that the 
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external circuits are such as to offer an infinite impedance at any fre- 
quency very far from the carrier frequencies present in the multiplier. 
This enables us to consider the multiplier as a finite port multifrequency 
system. 

It may now be seen that by using (25) and (26) we can obtain a 
relation between V.,, Var, Lar, [g1, Vae; Vee, Las, and Is,. In 





Fig. 3— Small-signal terminal behavior of pumped nonlinear twoport. 


particular we can writet 


vr / vr tr 
Vat Leia VAT alas Lips Lat 


Ver} _ | Zptar Zhis: Zhtas Zotac || Lar (28) 
Ve Gent Laat 2 ate 2 aaiel| dag) * 
Via, Zbiar Zeesr Zosas LZhebet ites 
or 
Vexsie = (ZaspiaeGespiae (29) 


Equation (29) relates the small-signal fluctuations existing at input 
and output terminals of a pumped varactor diode. In case one is in- 
terested in relating the AM and PM fluctuations at the input and output 
terminals of a pumped varactor diode, we make use of (11). If 9,,, 
or, Git, and y;, are the phase angles of carrier voltages and currents 
at the input and output of a pumped varactor diode we get the fol- 
lowing equation which relates the different fluctuations: 


t In certain cases it is possible that the matrix (Z,-s):_. does not exist. Even 
though (Z,-s)1_s may not exist, in most cases of practical interest, we can always find 
a relation between the terminal voltages and currents at the sideband frequencies 
in the vicinity of input and output carriers. This will be shown to be true in the case 
of a tripler which is discussed elsewhere in this paper. However, the matrix (Za-g)n 
always exists for a pumped varactor diode. 
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Va Lai 
Voi ie Seal #86 |) (30) 
Vis es, 
Vex 1 
where 
(Zacpdins = Odi Zea iszAGizal 5 (31) 
1 
Oee= ae a (32) 
O 1 Avs 
and 
| 
Q)i-. = bee oe (33) 
(sae oe 


The matrices )’s are given as in (12). 

Once we have obtained the impedance matrix representation for the 
pumped varactor diode other kinds of representations like scattering 
matrix representation or chain matrix representation could be derived 
for any specific application or convenience. Mutual relations between 
these representations are given in Refs. 11 and 12, and we shall not 
discuss them in this paper. Scattering matrix representation of loss- 
less abrupt-junction varactor multipliers is extensively treated in 
later sections of this paper. 


IV. SCATTERING PARAMETERS FOR PUMPED NONLINEAR SYSTEMS 


The total voltage v(t) in the vicinity of a carrier at frequency -kwo 
can be represented as in (5) or (6). v,,(t) can be interpreted as a small 
perturbation on the amplitude | V, | of the carrier, and v,,(¢) as a per- 
turbation on the phase kw ot + ¢,, of the carrier. Since the device acts 
as a time-variant linear device to the fluctuations and since super- 
position holds, v,,(é) and v,,(é) can be represented as in (7) and (8). 

The voltage AM and PM modulation indexes at the carrier frequency 
kw) may, therefore, be defined as 





a Var 
Mok i | Vi | (34) 
and 
V ok 





6,4, = | = (35) 
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The AM and PM indexes at the input and output of a pumped non- 
linear system are, according to (30), related by an equation of the form 





[ my: Mit 

0 0; 
= A ae ’ (36) 
Mays Mis 
L Oi 6;. 
where 
-1 
| Vi | 0 
V -1 
(Znx~ixe = | ; | : (Dilixg 
: LAN 
La 
| I, | 0 
| Ii | 


(37) 


, ra 
ra 


It is assumed that carrier voltages at frequencies lw) and sw are nonzero. 
~ Until now we have exclusively used the impedance formalism to 
describe the properties of the pumped nonlinear system at the side- 
band frequencies. The choice of an appropriate formalism is particularly 
important in theoretical studies where important properties of the 
system may be obscured by complicated equations. The scattering 
parameters of a system are a set of quantities which can describe the 
performance of the system under any specified terminating conditions, 
just as the impedance (or admittance) quantities can, but while the 
scattering coefficients may not be particularly convenient for short or 
open-circuit computations, they may be applied in a relatively simple 
fashion when the network is terminated in a prescribed load impedance. 
Since we will be mainly interested in studying proper terminations for 
the system in order to realize certain desirable characteristics, scattering 
matrix formulation to describe AM and PM fluctuations in pumped 
nonlinear systems seems to be the most desirable.''*'*'** Equation (36) 
relates the AM and PM indexes or normalized voltages and currents 
at the input and output of a pumped nonlinear system.{ The incident 


£ Most of these concepts can be extended in a straightforward fashion if the 
pumped nonlinear system has more than two accessible ports. 
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and reflected AM and PM indexes (see Fig. 4) can, therefore, be written” 
as 


(m); = 3(m; +m), j=he, (38) 

(m,); = 3(m,; — mj), Jg=l,s, (39) 

(8:); = 26; + 4:4), j=1,s, (40) 
and 

(8,); = 2(8,; — 9:5), j=Ls. (41) 


Using (87) through (41), we can now obtain the following scattering 
matrix representation for a pumped nonlinear system: 


(ms) | gt So» | Om 
(m,)s = i a: (mi). y (42) 
(8,)1 s..' 8, || 
(8,)s.. (8.), 





The relation between scattering matrix in (42) and impedance matrix 
in (37) is easily derived.’* In our case, this is given by 
(S)i-e = La — 2{1a + (Zi-o)i-2} (43) 


where 1, is the unit matrix of order 4, and where Z’,_, is related to 
Zm-o in (87) by 


[1 0 0 0 1000 
0010 001 0 
(Zi-s)i-. = (Zipcode ‘ (44) 
0100 0100 
0001 000 1 





PUMPED 


NONLINEAR 
SYSTEM 





Fig. 4— Representation of AM and PM fluctuations in a pumped nonlinear 
system. : 
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The scattering matrix which relates the small-signal fluctuations of a 
pumped nonlinear system is, therefore, given by (43). We assume that 
the matrix 1, + (Z%_,):_-, is nonsingular. 

We would like to point out here that the matrices Sic, Sap, Spe, 
and S,, are all square matrices of second order. For reasons which are 
evident from (42), the matrix S,, will be referred to as AM scattering 
matrix, S,, as the AM-PM scattering matrix, S,, as the PM-AM 
scattering matrix, and S,, as the PM scattering matrix. 


V. SCATTERING MATRICES OF NOMINALLY DRIVEN LOSSLESS ABRUPT- 
JUNCTION VARACTOR FREQUENCY MULTIPLIERS{ 


The theory developed in the preceding sections will be utilized from 
hereon in order to obtain the scattering parameters of nominally 
driven lossless abrupt-junction varactor frequency multipliers. The 
elastance S(t) of the varactor diode as it is pumped is assumed to be 
given by§ 


S(t) = ps S, exp (jkwol). (45) 


k4#0 


In this section we shall first obtain the scattering matrix of a varactor 
doubler whose input and output circuits are tuned. In the later part 
of this section the scattering parameters of a tripler, whose input, output, 
and idler circuits are tuned, are also derived. The discussion of the 
_ scattering parameters of multipliers of higher order is postponed to 
later sections of this paper. For all the multipliers considered in this 
paper it is assumed that the bias circuit is properly designed so that 
there are no currents flowing at the sideband frequencies -kw.° Even 
though the currents flowing in the varactor are themselves not limited 
by the diode in the range of available frequencies we assume that the 
external circuits connected to the diode are such that they allow currents 
to flow in the varactor if and only if the frequency spectrum of these 
currents is in the vicinity of input, output, and idler carrier frequencies. 
This enables us to consider the multiplier as a finite port multifrequency 
system. 


tSee also Refs. 3 and 4 for alternate derivation of some of these results. 

§ The average elastance So of the varactor diode can always be included with 
the external circuit. The assumption that S) = 0 made in this section does not, 
therefore, involve any loss of generality. 
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5.1 Scattering Parameters of a Doubler. 


In a doubler, the only nonzero elastance coefficients are S.,, and 
S.i2. The impedance matrix (Z,-g)2 in (25) is represented as (see Fig. 5) 





Va 0 ers ers 9 [te 
Yn! |mara 9 0 TS (46) 
ise to : 0 : = 
Vn| 0 7( eer : , pe 


We shall now assume that input and output circuits are tuned which 
usually gives near optimum efficiency for a doubler.’’”’* We also assume 
that 


(6) 
a <i. (47) 


With these two assumptions, we can write the following matrix equa- 
tion for a doubler: 





7 
Vex 0 [St [St 9 Iz, 
Wo 2W9 

= 0 0 (48) 

Vue shel 0 0 0 las 
Wo 
V 2 0 _ 1S I 0 0 I p02 
Wo al 


VARACTOR 


DIODE 





Fig. 5—-Small-signal behavior of a varactor doubler. 
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The phase anglest of carrier voltages and currents are given by’ 


gv = O, (49) 

gn = 0, (50) 

Qve = 7, (51) 
and 

Yn = 7. (52) 


From (31) the impedance matrix (Z,_,)2 is, therefore, written as§ 


gee el ae 


Wo 20 
A. nlSel |S; | 
(Zo-s)a = wo ace (53) 
|S: | ? : 0 0 0 
0 
Wo 


Equation (53) clearly shows that in a doubler which is properly tuned 
and whose bias circuit is properly designed, there is no amplitude-to- 
phase and phase-to-amplitude conversion.° 

It is shown in Appendix A that 


a era (54) 


sae reps os (55) 
[I2| 4 | S2 | wo’ 


NRL) a (56) 


ee 20 : 


t Without loss of generality the phase angle of input carrier voltage at frequency 
wo is assumed to be zero. y72 is the phase angle of the current through the load con- 
nected to the doubler. 
§ The matrix (Z):_; will be written as (Z), in case this does not lead to any 
ambiguity. 
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and 


[Vel _ LS. on 


| J, | 20 


According to (37) and (53), the matrix (Z,,-»)2 is represented as 


(Zmaa)2 = (58) 


oN Oo 
oO 
So 
ms) 


The scattering matrix of a doubler, whose input and output circuits 
are tuned is, therefore, according to (43) 


! 0 
(Sa (59) 
9 19 -1 
ie al 
(Soa > fk a 3 (60) 
1 O 
(Sar) = 9, (61) 
(Spa) = Q, (62) 
and 
(Soa) = fe | (63) 
2. 4 


5.2 Scattering Parameters of a Tripler. 


In a tripler, carrier currents flow at the frequencies +wy , +2w. , and 
+3w,) . The impedance matrix (Z,-s)3 18 given as 


S. S# S; St a 











Vo) | 0 Gee Fa) Ton FS Ba Fo) We 9 | 

el entqecna © a ne 
Val lity 9 wre 9 

Vs i orc a ea _ oe 


SNOILVIGY DONIYALLIVOS 


LILT 
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If we now assume that 


2 <1 (65) 


Wo 


and also that input, output, and idler circuits are tuned and that’ 


gv = 0, (66) 

gn = 0, (67) 

Yv2 = 0, (68) 

gr = 0, (69) 

gys = 7, (70) 
and 

gis = 7, (71) 


we can show that the impedance matrix (Z,_,)s; does not exist for a 
tripler.t It is also shown in Appendix A that 


|S] = | 8, |/2. (72) 

Equations (64)-(72) show that§ 

a 2 

(ae (73) 

Va 3 

V7 2? (74) 

Tn 2 
and 

Vian 1 

Ve ve 


Equations (73) through (76) show that even though the amplitude-phase 
impedance matrix may not exist for a pumped nonlinear system (such as 


t The termination at the idler port just tunes out the average elastance of the 
varactor diode at the carrier frequency 2wo. 

§ We note that a tripler behaves like an ideal transformer of ratio 3/2 to the 
amplitude components of the fluctuations. Higher order terms in w/wo are as- 
sumed to be negligible, even when first-order terms vanish. The frequency- 
dependence usually introduced by the external idler termination, therefore, does 
not appear in the scattering matrix of the tripler. 


SCATTERING RELATIONS 1719 


a tripler) we are able to find a relation between the different terminal 
variables. These relations are sufficient to obtain the scattering param- 
eters of the network.’* We immediately note from (73) through (76), 
that there is no amplitude-to-phase or phase-to-amplitude conversion in 
a tripler. Accordingly, 








(Sir)3 = 0 (77) 
and 

(Spa)3 = Q. (78) 
It is shown in Appendix A that 

Vv, 3 

ryt 2 
and 

[nt _ 2. 

a (80) 


From (84), (85), (73) through (76), (79), and (80), we can show that 
1 
(Soa) = P (81) 


1 0 
and 
(Sy)3 > K vF | (82) 
3. 0 


Equation (81) could have been written down by noting that a tripler 
behaves like an ideal transformer of ratio 3 to the amplitude components 
of its fluctuations. 

The scattering parameters of a tripler, whose input, output, and 
idler circuits are tuned, are therefore, given by 


(S)s = eee ead (83) 


In order to obtain scattering parameters of multipliers of higher 
order with the least number of idlers we shall show that a multiplier 
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of order 2” is equivalent to a cascade of n doublers, a multiplier of order 
3° is equivalent to a cascade of s triplers, and that a multiplier of order 
2°3° is equivalent to a cascade of n doublers and s triplers. 


VI. EQUIVALENCE OF A MULTIPLIER OF ORDER N = N, X Ne TOA 
CASCADE OF TWO MULTIPLIERS OF ORDER NV, AND Nf 


In this section it is shown that, if the idler configuration of a multiplier 
of order N = N, X N, satisfies certain conditions, then the multiplier 
can be represented as a cascade connection of two multipliers of order 
N, and N, . In this and in the following two sections, no restriction is 
placed on the type of input, output, and idler circuits. Therefore the 
following discussion also applies to the case of a multiplier which is 
lossy and not tuned. 

Consider an abrupt-junction varactor multiplier of order N = 
Ni X N,. Let B denote the set of all positive and negative integers 
which are equal in magnitude to the orders of the various harmonics 
present in the varactor current. Furthermore, let B, indicate the subset 
of B which consists of the elements of magnitude equal to or less than 
N, , and let B, denote the subset of B which consists of the elements of 
magnitude equal to or greater than N, . In this section it will be shown 
that, if B satisfies the following condition: 


B is such that, if (7, s, h) is a subset of Bandifr+s+h = 0, 
then either (r, s,h) C B, or (7, s, h) C B., (84) 


then the multiplier is equivalent to a cascade connection of two multi- 
pliers of order N, and N,. Notice that an abrupt-junction multiplier 
of order N = N, X Ne which satisfies (84) must have an idler at the 
harmonic Ny. In fact, for such a multiplier this idler is necessary in 
order to produce harmonics of order higher than N wo." 

Consider then a multiplier of order VN, X N. which satisfies (84) 
and let it be represented by the very general equivalent circuit shown 
in Fig. 6. The generator v,(t) is sinusoidal and is of frequency w . Z(w), 
the impedance of the external multiplier circuit as seen from the non- 
linear part of the capacitance of the varactor, is assumed to be finite 
only in the vicinity of the input, output, and idler frequencies. Since 
Z(w) includes the average elastance and the series resistance of the 
varactor, the nonlinear capacitor of Fig. 6 has a g-v characteristic of 
the type: v = Aq’, in which A is a constant multiplier. Consider now 


+ The results of Sections VI, VII, and VIII represent sguuar'e of an earlier 
result, the equivalence demonstrated in Ref. 1 for the case N = 


SCATTERING RELATIONS 1721 


Aq? (t) 





Fig. 6— Varactor harmonic generator. 


the circuit shown in Fig. 7. It will be shown that this circuit provides 
an alternative and complete representation of the multiplier of order 
N, X N,. The two nonlinear capacitors of Fig. 7 and that of Fig. 6 
are assumed to have the same g-v characteristics. The three networks 
F,,F., Fs; are ideal filters which have zero impedances at the carrier 
frequencies which satisfy respectively the relations w < Nywy ,w = Nyw, 
w > N,w., and also at their sidebands. Furthermore, at frequencies 
different from these, they have infinite impedance. 

Before beginning the demonstration of the equivalence of the two 
circuits of Figs. 6, and 7, it may be profitable to examine briefly the 
behavior of the circuit of Fig. 7. The circuit of Fig. 7 represents the 
cascade connection of two multipliers of order VN, and N,. More pre- 
cisely, consider the network connected on the left side of the first ca- 
pacitor. For wo < Ny, it is equivalent to the network connected to 
the capacitor of Fig. 6. Therefore, it pumps at w = w, the first capacitor 
of Fig. 7 and, in addition, it provides the proper idler terminations for 
the generation of the harmonic N,w). A current component at this 
harmonic is therefore gencrated by the first capacitor and it flows in 
the second loop shown in Fig. 7. The second capacitor is thus pumped 
at w = N,w by this current. Note that the network connected to its 
right provides the proper idler terminations for the generation of the 


4 = IU L 442 L 943 
Vat 2o-at 2) at 





Fig. 7 — Equivalence of a multiplier of order NiNz to a cascade of multipliers 
of orders Ni and Ne. 
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output harmonic N,N.w,. Therefore, the second capacitor delivers 
power at this harmonic to the network on its right. 

Proof: First, consider the circuit of Fig. 6. The nonlinear capacitor has 
a q-v characteristic of the type: v = Ag’. Thus, V_,, the complex ampli- 


tude of v(t) at the frequency w = —hw,, is related to the various com- 
plex amplitudes of g(é) through the relation 
Vi=A DO QQ. (85) 
Rane 
By introducing the constraint given by the linear circuit at w = —hwp , 
one obtains 
Venn t jhenZ(Q1 =A 2 OQ., heB, (86) 
r,s)c 
rtst+h=0 


where V,,_, is the complex amplitude of v,(¢), and is zero for |h| # 1. 

Relations (86) give the equilibrium equations of the circuit of Fig. 6 
and they determine the various charge amplitudes Q, , Q2 , etc. Notice 
that in the summation of the righthand side of (86) one has r+s+h=0 
and (r, s, h) C B. Therefore, from (84) one obtains the following three 
cases: if |h| < N,, then (r, s,h) C B, ;if | h| = N,, then, depending on 
the values of r, s, either (r, s, h) C B, or (r,s, h) C Bo; if|h| > M,, 
then (r, s, h) C B,. Accordingly, (86) can be written as 


Vin + jhooZQ-.=A Dd) QQ, if [A] <M. (87) 


(r,s)CBy 
r+s+h=0 
HhooZQ-»= Al 2 QQ.+ 2 O03, Hf [hl = Ni (8) 
(r,s)CBy r,s)CBa 
r+s+h=0 r+s+h=0 
jhnZQ_,=A >> QQ., if [Al >M. (89) 
(r,s)CBe 
rtst+th=0 


Let now the circuit of Fig. 7 be examined. Consider the charges q,(t), 
ge(t) and q3(t) flowing through the three filters F, , F,, and F’; . Notice 
that q.(t) + q(t) is the total charge of the first capacitor, and that 
ge(t) + gs(t) is the total charge of the second capacitor. Because of the 
characteristics of the three filters F,, F,, F3, qi(é) + q(é) contains 
(all and) only the frequencies 7w) for which r ¢ B, . Similarly, g2(t) +¢q3(t) 
contains only the frequencies for which r ¢ B, . Now consider the total 
charge 


q(t) = alt) + g(t) + 9s(4) (90) 


and let the symbol ( )’ distinguish the complex amplitudes of q’(t) 
from those of g(é). It will be shown that g(f) = q’(é); more precisely, 
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it will be shown that the two circuits of Figs. 6 and 7 have the same 
equilibrium equations at the carriers. 


First notice that, for w = —hw, , the complex amplitude of the voltage 
of the first varactor of Fig. 7 is 
A do QQ! (91) 
Aare tly 


and that of the second varactor is 


Ad Qa. (92) 
(r,s)CBe 
r+s+h=0 


Next, notice that the equilibrium equations of the circuit of Fig. 7 
for |w| < Nw. are obtained by applying Kirchhoff’s law to the first 
loop of Fig. 7. Similarly, for || = Nw. , one considers the second 
loop and, for |w| > N,wo, one considers the third loop. Therefore, 
by taking into account (91) and (92), one obtains that the equilibrium 
equations of the circuit of Fig. 7 are given by (87), (88), and (89), with 
Q,, Q, replaced by Q/ , Q/ . Therefore, q/(t) = q(t). 

The preceding demonstration has shown that the two circuits of 
Figs. 6 and 7 are equivalent at the carrier frequencies. At the various 
sideband frequencies, the equivalence is demonstrated in very much 
the same way. Since the elastance coefficients of the two circuits are 
equal, one finds that the sets of small-signal equations of the two circuits 
are equal. 


VII. DISCUSSION OF THE TWO PARTICULAR CASES N = N, x 2 AND 
N = N, x 3 


In this section the two particular cases N. = 2 and N, =4 will be 
examined. More precisely, it will be shown that in these two cases 
condition (84) becomes: 


If N2 = 2, the two highest harmonics present In the varactor 
current are Niw. and 2N yw . (93) 


If N. = 3, the three highest harmonics are Nyw. ,2Nywo ,3NiwW9. (94) 


The demonstrations are very much the same in the two cases and there- 
fore, only the case N, = 2 will be considered. 
Proof: Consider the case Nz = 2 and suppose that (93) is satisfied. 
Then, consider the three sets B, B, , B, defined in the preceding section. 
From (93) one has B, = (—2N,, —N,,N,, 2N;). 

Now, consider a subset (7, s, h) of B and suppose that r + s + h = 0. 
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First, notice that (r, s, h) cannot belong to both B, and B, because this 
would give |r| = |s| = |h| = N,, which violates the hypothesis 
r+s-+h = 0. It is therefore, sufficient to prove that if (r, s, h) does 
not belong to B, , then it must belong to B,. 

Suppose therefore, that (7, s, h) does not belong to B,. Then one 
of the three elements 7, s, h has magnitude equal to 2N, and, since 
r-+s-+h = 0, the remaining two elements have magnitude equal to 
N,. Therefore, (r, s,h) C B,. 

The conclusion is that, if (r, s,h) C Bandr +s +h = 0, then either 
(r, s, h) C B, or (r, s, h) C B,. This concludes the demonstration. 


VIII. EQUIVALENCE OF A MULTIPLIER OF ORDER 2”3° TO A CHAIN OF 
DOUBLERS AND TRIPLERS 


Consider an abrupt-junction varactor multiplier of order N = 23° 
which has the least number of idlers. In this section it will be shown 
that this multiplier is equivalent to a chain of doublers and triplers. 
The order in which the various doublers and triplers are connected 
depends on the particular idler configuration. This will be clarified by 
the following demonstration which shows how to derive the equivalent 
chain of multipliers. 


Proof: Since the multiplier has the least number of idlers, there are two 
cases: either the highest idler frequency is Nw)/2, or the two highest 
idler frequencies are Nw/3, 2Nw/3. In both cases, the results of the 
preceding sections are applicable and therefore, the multiplier can be 
represented by means of a cascade of two multipliers of order NV, and 
N,. Note that N, is 2 in the first case and 3 in the second case. Note, 
furthermore, that if n + s = 2, then either VN, = 2 or N, = 3, and 
therefore the demonstration would end at this point. 

Ifn + s > 2, on the other hand, then NV, > 8 and the decomposition 
of the multiplier of order 2”3° into two multipliers of lower order can 
evidently be continued by the decomposition of the first of the two 
multipliers, the multiplier of order N, . If this process is carried as far 
as possible, the final structure will be a chain of doublers and triplers. 

It is important to point out that the results of this and the preceding 
section can be generalized in the following way: 


An abrupt-junction varactor multiplier of order N = N, X 

N2 X +++ X N, which has the least number of idlers can be (95) 
represented by a.cascade of n multipliers of order N,, No, 

etc., each with minimum number of idlers. 
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In fact, ifn = 2, then (95) follows directly from the equivalence dem- 
onstrated in Section VI because it can be shown that a multiplier 
which has the least number of idlers satisfies (84). 

If n > 2, then the multiplier can be decomposed into multipliers of 
lower order as it has been done for the particular case N = 23°, 


IX. SCATTERING RELATIONS FOR HIGHER-ORDER LOSSLESS ABRUPT- 
JUNCTION VARACTOR FREQUENCY MULTIPLIERS 


The scattering matrices of lossless abrupt-junction varactor fre- 
quency doubler and tripler are derived in Section V. Multipliers of 
order 2”, 3°, and 2"8° with least number of idlers{ are treated in this 
section. 


9.1 Multipliers of Order 2” with Least Number of Idlers 


A lossless abrupt-junction varactor frequency multiplier of order 2” 
with least number of idlers has been shown to be completely equivalent 
to a chain of n doublers. We shall assume in this section that the input, 
output, and all idler circuits are tuned, and that these idler terminations 
are lossless. The idlers are at frequencies 2. ,1 S r S (n — 1). The 
equivalence of a multiplier of order 2” to a chain of doublers can be 
utilized in getting the scattering relations for multipliers of order 2” 
when n > 1. The scattering relations when n = 1 are given in (59). 

We can show that a multiplier of order 2” with least number of idlers 
has the following scattering matrix: 





Pia - ACS ee a. wre | 
(} ae ne ! 0 
CS) acs Die ase Le Se COE (96) 
fa! Gely 
0 
bison 1 C1) xs 
L a (f ae \ 





9.2 Multipliers of Order 3° with Least Number of Idlers 


Multiplier of order 3° with least number of idlers has the idler cur- 
rents flowing at frequencies 29, 3w), 6wo, 9wo, «++ » 3° wy, 3) — 
37a, Biay, «++ , 3" — 8” wo. We have shown that such a multiplier 
is completely equivalent to a chain of s triplers. 


+It is assumed that all these idler circuits are tuned and that the idler termina- 
tions are lossless. It is also assumed that input and output circuits are tuned. 
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The scattering parameters of a tripler are shown to be 
0 li | 


(Se ea | ee (97) 


Oo | 
13 0] 


If s such triplers are cascaded, we obtain a multiplier of order 3° 
with minimum number of idlers. The scattering parameters of a cascade 
of s triplers can be shown to be given byt 


oe a 


(eS posses ! (98) 


1 


13° 04 
9.3 Multipliers of Order 2"3° with Least Number of Idlers 


It has been shown that a lossless abrupt-junction harmonic generator 
of order 2”3’, n and s integers, with least number of idlers is completely 
equivalent to a cascade of n doublers and s triplers connected in proper 
order depending upon the idler configuration. 

The AM and PM transfer scattering matrices’® of a doubler and 
a tripler can be shown to be 


Tas ' (99) 
0 1 


tole 
tol 
[eee atl 


C35 iS ; (100) 
(Laa)3 = be 4 (101) 
01 
and 
(Lno)3 = a | (102) 
os 


: t We have assumed that input, output, and all idler circuits are tuned, and loss- 
ess. 
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Since AM and PM transfer scattering matrices of a tripler are scalar 
matrices,’* values of (T'a.)2.3+ and (T'pp)o03+ are independent of positions 
of the triplers in the cascaded multiplier.t This shows that 





(Taa)orae = (T'aa)on (103) 
specie tl ees 
= |(-D2" 3-*3°2 
0 1 
and 
(Pyp)onge = 3 “(TL yp) 2 (104) 
(=1)" 0 
SN 
(—1)” 2. 1 —n —n 
ae 2 2 


Since we also know that there is no AM to PM and PM to AM con- 
version in both a doubler and in a tripler it follows that 
(Sap)on3¢ — (Spa)ange = 0. (105) 


Using (103) through (105), we conclude that the scattering param- 
eters of a multiplier of order 2"3° with minimum number of idlers are 
given by 


jE (se De Seer ea 
3 a | 0 
i 
(S)orse = ee ee eee easter te aaa nett - (106) 
: 0 (—1)"37° 
0 a 
longs L_ (= 1)"2" 
3 3 
or 
Got) ee ee ere 
GS) saa 4 3 3 2 ( 1) 2 (107) 
1 0 
and 
0 (—1)"37° 
(Spp)arse = (108) 


ngs fC Pai 
2°93 ar ia 2 


¢ Matrix product AB is not, in general, commutative." 
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Equations (106) and (107) show that the scattering parameters 
of a multiplier of order 2”3° are independent of the idler configuration 
of the multiplier. For example, 1-2-3-6-12 and 1-2-4-6-12 multipliers 
have the same scattering matrix. This result arises because of the scalar 
character of AM and PM transfer scattering matrices of a tripler and 
is not true in general. 


xX. RESULTS AND CONCLUSIONS 


A general method to obtain the scattering parameters of a pumped 
nonlinear system when the system is subjected to small band limited 
fluctuations has been presented. 

For a lossless abrupt-junction varactor frequency multiplier of order 
2” which has minimum number of idlers and whose input, output, and 
idler circuits are tuned, it is shown that the scattering matrix S is 
given by 











1 (—1)” —n — em 
37g 2) (Ue | 0 
1 O. .4 
CS oe: ae es ie ! iene ites - (96) 
0 ( 1)” 
L 3 2 ee 


Such a multiplier has also been shown to be completely equivalent 
to a cascade of n doublers. 

For a lossless abrupt-jJunction varactor harmonic generator of order 3° 
with minimum number of idlers and whose input, output, and idler 
circuits are all tuned it is shown that the scattering matrix S$ can be 
represented as 


eoeeee (98) 
| 
3° 0 | 

A multiplier of order 3° has been shown to be equivalent to a cascade 
of s triplers. 


However, for a lossless abrupt-junction varactor multiplier of order 
2”3° with minimum number of idlers it has been shown that this multi- 


{ The transfer scattering matrix (7')y,y, is not, in general, equal to (Z')y,n,. 
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plier is equivalent to a cascade of n doublers and s triplers, and that the 
scattering matrix S can be written as 





DE A op cote 

eg ee 0 
1 
\ 

(S)onse = ee: a a Se aad S220 ote fa ; (106) 
0 (—1)"3° 
0 | n 

Von 8 oon (—1) Qn 
(ast aie 


For lossless abrupt-junction varactor multipliers of order 2”, 3°, and 
2"3°, n and s integers, with minimum number of idlers, one of the general 
results is also that if w/w») « 1, there is no amplitude to phase or phase 
to amplitude conversion or equivalently 


Sag = Dus = 0. (109) 


The scattering matrices, of lossless abrupt-junction varactor multi- 
pliers of order different from those treated in this paper can be ob- 
tained by straightforward application of the methods presented in this 
paper. We, however, feel that most of the lossless abrupt-junction 
varactor multipliers commonly encountered in practice are covered in 
this paper. If the junction characteristic of the varactor diode is far 
from being abrupt or if the junction is overdriven, the same general 
methods can be applied in order to get the general scattering matrix 
which relates the fluctuations at different parts of the system. If the 
bias circuit is poorly designed so that there are currents flowing in the 
system at frequencies --w, the techniques developed in this paper are 
still applicable. 

At present very little is known about the stability of driven systems 
like harmonic generators. The results derived in this paper can be 
made use of in studying the stability of such systems and, in particular, 
in obtaining the restrictions imposed by the condition of stability on 
the available circuit configurations. This theory also enables us to de- 
rive an expression for the output signal of a pumped nonlinear system 
having noise sources at several locations in the circuit. A complete anal- 
ysis of the noise performance of the systems like harmonic generators 
can be carried out once we know the general scattering parameters of 
the system. All these and other related results are reserved for a future 
publication. 
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APPENDIX A 


Large-Signal Analysis of Abrupt-Junction Varactor Doubler and 
Tripler 

The large-signal equations of a varactor harmonic generator are 
given in Ref. 5. The varactor considered in this paper is a lossless var- 
actor whose average elastance So is considered to be a part of the ex- 
ternal circuit for the sake of convenience. Let S(t) be the elastance of 
the varactor as pumped. For an abrupt-junction varactor diode we 
also note? that 


jkw Sx, a i(k = LwoS,-1 eee eee ts J209S2 = ld Sy 
Ty re I, q, , 
k an integer. (110) 


The large-signal equations for a doubler can be written as 


_ SiTt + Stl, 


Jo 


Vi (111) 


and 

= Sil, ; 
i J2wWo 
It can be shown! that the time origin can be chosen so that J,, and I» 
are both real. From (110) through (112), we can now write 


G (112) 














V S 

ken ap 
29 eee es a 
ds 7 4 | So | wo , (114) 
V; Si 
cA = ied : (115) 

and 

V2 | Si | 

| ¥: = Deo, . (116) 





The large-signal equations for a tripler can be written as 


_ Sof + Salt + Stl, + S81; 


Jo 


Vi ‘ (117) 


* * 
y, = Self + Sih + Sie (118) 
J20 
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and 


= Sil, + SI, 


Vs J3W 


(119) 


It can again be shown that if we choose J; to be real and positive 
I, and Is are real. Let us assume that the idler termination is tuned 
and is lossless. From (118) we can write 


| Ss | = |S: |/2. (120) 
According to (110), (117), (119), and (120), we also have 


tel ie (121) 





\Je| 3 
and 

Vil} _ 3 

| Vas (122) 
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Effect of Redirectors, Refocusers, and 
Mode Filters on Light Transmission 
Through Aberrated and Misaligned Lenses 


By E. A. J. MARCATILI 
(Manuscript received June 16, 1967) 


The field distortion of a beam propagating through a sequence of identical, 
misaligned and slightly aberrated lenses is calculated as a perturbation 
of the Gaussian beam that would propagate in the absence of aberration. 

It is found that most of the converted power goes to the first and second 
modes. They produce deflection and spot-size change of the ideal beam, 
respectively. The power coupled to modes higher than the second deform 
the Gaussian profile. 

In general, the mode conversion per unit length of guide can be reduced 
by making the spot size small and by avoiding in-phase coupling at every 
lens. This last condition is achieved by choosing the period of oscillation 
of the beam different from an integer number of lens spacings. 

Before the beam becomes too distorted, the converted modes must be 
eliminated. Power in the first and second modes can be reconveried loss- 
lessly to the fundamental Gaussian beam by means of servoloops that 
redirect and refocus the beam. If refocusers are not used, the power in 
the second mode, as well as the power in the higher-order modes must 
be absorbed in mode filters such as irises. 

For lenses with fourth-order aberration such that at a beam half-width 
distance from the center the focal length departs 6 percent from ideal, 
the following typical results are obtained: 

In a guide in which the distance between the beam and guide axes is 
a constant plus a sinusoid, the converted power is proportional to 4’, 
to the fourth power of the amplitude of the sinusoid and to the square 
of the number of lenses, but is roughly independent of the curvature of 
the guide axts. 

On the other hand, in a guide in which the distance between the beam 
and guide axes ts a constant plus a random quantity the converted power 
is proportional to 5°, to the square of the guide curvature, to the mean 
square of the random deviation, and to the number of lenses. 
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For 6 = 1 percent, a 1 percent power conversion to the second mode 
occurs in typical examples, after a few tens of lenses, and the order of 
magnitude of mode conversion is 0.001 dB/lens. Most of that power 1s 
in the second mode and can be recovered with refocusers. 


I. INTRODUCTION 


Sequences of widely separated glass lenses or periscopic mirrors,’ 
as well as sequences of low loss closely spaced gas lenses,?* have been 
proposed as beam waveguides for long distance optical transmission. 

The theory describing the wave and ray propagations through a 
sequence of misaligned, thin, perfect lenses is known, and those results 
are applicable even if aberrations are present, provided that the num- 
ber of lenses is small. Nevertheless, when that number is large, the 
cumulative effect of lens imperfections must be included. 

Before introducing aberrations, though, let us briefly review what is 
already known about wave and geometric optics in a beam waveguide, 
assuming throughout the two-dimensional problem instead of the 
more realistic and complex, but not more enlightening, three-dimen- 
sional one. 

A paraxial Gaussian beam launched in a periodic sequence of 
identical thin, aberration-free, but misaligned lenses**7* conserves 
throughout the Gaussian transverse field distribution. The spot size 
depends on the initial conditions, the focal length f of the lenses and 
their spacing ZL, but is independent of the lens alignment and does not 
grow with the number of lenses. The geometry of the beam axis, on the 
other hand, depends also on f, Z, and the initial conditions, but more 
important, it depends on the alignment of the lenses. In general, the 
beam axis oscillates about the guide axis and the amplitude of the 
oscillations increases with the number of lenses. For a given set of 
lenses through which a beam is to be guided, there are then alignment 
tolerances which must be satisfied in order to prevent the beam from 
hitting the edges of the lenses. Those tolerances can be drastically 
alleviated by using redirectors,® that is, servoloops that realign the 
beam axis with the guide axis. 

Nevertheless, if the lenses have aberrations, the beam does not con- 
serve the Gaussian profile, but deforms itself*® 11-17-13 as it travels along 
the guide, the definition of the beam axis then becomes fuzzy, the re- 
directors become less and less effective, and eventually the grossly dis- 
torted beam hits the edges of the lenses. 

Gloge’* has found the effects of random aberrations such as those 
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which occur both in glass lenses and in the controlled atmosphere be- 
tween them. Because of the randomness of the aberrations, the beam 
distortion is independent of the beam trajectory. On the other hand, 
if all the lenses have the same aberration such as in gas lenses, the 
beam deformation is strongly dependent on the relation between the 
beam and the guide axes. 

This paper gives an estimate of the beam deformation as a function 
of systematic aberrations, lens misalignments, and presence or absence 
of redirectors. It also suggests ways of preventing the beam deteriora- 
tion together with their price tags. 


II, WAVE TRANSMISSION THROUGH SLIGHTLY ABERRATED AND MISALIGNED 
LENSES 


We begin reviewing the wave transmission through ideal lenses and 
afterward the lenses are slightly perturbed and the mode conversion is 
calculated. 

The wave transmission through a sequence of ideal, thin, equidistant 
and misaligned lenses as those shown in Fig. 1 is known.5:%7:8 

The guide is completely defined by the focal length f of the lenses, 
their separation L and the radius of curvature R, of the guide axis at 
every lens. 

The beam axis is characterized by its distance s, to the guide axis 
at the nth lens. If the beam is launched through the center of the first 
lens, it is known? that s, is related to L, f, and R, by 


“ ee GUIDE 
“er 4 7 AXIS 






LENS N-t JALE 


GAUSSIAN 


Fig. 1— Beam transmitted through misaligned ideal lenses. 
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L? Ssin (n — m)6 
= a 1 
Ss sin 6 “aI Rn ' (1) 
where 
L 
Pete ean 2 
cos @ l 5; (2) 


The field distribution at every equiphase surface is Gaussian, and its 
width varies along the beam. Nevertheless, if properly launched, the 
beam maintains the same half-width w and the same radius of curva- 
ture r of the wavefront at every lens. As depicted in Fig. 1, the beam 
between surfaces a,-; and 8, is the same for all n. 

Proper beam launching is achieved if, at the first lens, the radius 
of curvature of the wavefront is 


p= J; (3) 


and if the beam half-width w is related to the wavelength A and the 
guide parameters in the following manner: 


=~ ,/-h. — , | Lz 
Oe Narsin d eoeee (4) 


Assuming the lenses to be two-dimensional, then the Gaussian beam 
is also two-dimensional and the electric field measured along the 
circles a, OF Bn is 


E = D,(2¢) = «* (5) 


and 
x 
eae (6) 


Strictly speaking, the normalized length — measured along the sur- 
faces a, or 8, does not coincide with x/w, but if the beam is paraxial, 
the discrepancy is negligible. 

If a higher mode is launched with the same wavefront curvature 
1/r and the same width w, the field distribution at every surface a, , Bn 
is described by the parabolic cylinder function” 


D,(2&) = e-* He,(2£) (7) 


in which He,(2¢) is the Hermite polynomial of order p. Between the 
surfaces a, and £,,,, the phaseshift of the pth mode is pé radians 
smaller than the phaseshift of the fundamental Gaussian mode. 
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Now let us calculate the phaseshift in passing from the equiphase 
surface 8, to the equiphase surface a,, Fig. 2. The length AB of a 
typical ray path between those two wavefronts is 


— 2 
AB =x 2+. 
f ? 
Calling 
pie (8) 
w 
and using (3), (4), and (8), we obtain 
AB = * (20,¢ + #) tan 2. 
T 2 
The total phaseshift in passing from surface 8, to a, is 
0 
or(on, + &) = 2(20,¢ + £) tan ak ®,(o, + &). (9) 





Y— +*--LENS CENTER 


NTH LENS 


\ 


Fig. 2— Phase front change through a lens. © 
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It is made essentially of two terms. The first is the phase due to the 
path AB, the second, ®,(c, + £), is the phase contributed by the lens. 
Only if the lens is ideal the surface a, will be an equiphase, and ¢,(¢, + &) 
is a constant. The phaseshift through a perfect lens is then 


®, ideai(o, + £) = const. — 2(2c,¢ + £) tan 5 (10) 
and since the constant introduces only an uninteresting uniform phase- 
shift, we will call it zero throughout. 

On the other hand, if the lens is not perfect, ¢,(c, -++ £) is not a 
constant and the field on the surfaces a, and 8, can no longer be de- 
scribed by a single mode, but rather by a superposition of modes 
as those given in (7). In general then, the field on the surface a, is 


Hla) = Y anD.28). (11) 


The amplitude a,, of each mode has been calculated in (57) under 
the assumptions of small lens distortions and small beam departure 
from ideal, that is, 


leon + £) | Ka (12) 
and 
laa |<«K1 for g>0. (13) 
That amplitude is 
inc Dieu (14) 
y=l1 


in which c,,, is the coupling coefficient between the fundamental and 
the gth mode at the vth lens, and its value, derived in (58), is 


] if qg=0, 


ae ee ate, (o,) 

2°q! X 2*ul  da,7*7* ee ae 
Within the approximations involved, the fundamental mode has ampli- 
tude one throughout. The amplitude a,, of the gth mode immediately 
after the nth lens, (14), is made up of n terms. All of them have simple 
physical meaning. Consider the vth term. The fundamental mode 
couples C,,, into the gth mode of the rth lens and this travels up to the 
nth lens without further conversion; its phaseshift with respect to the 
fundamental mode is q(v—7)8@. 


(15) 
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Since co.» = 1, the reconversion into the fundamental mode is not 
calculated explicitly. Nevertheless, it is automatically taken into ac- 
count when the power in the higher order modes is ascertained. 

The amplitude of the coupled mode ag, can be maintained small 
by techniques well known from coupled waves theory and which will 
be used later on: 

(zt) Selecting the phase at the coupling points to provide destructive 
interference. 

(iz) Dissipating the power in the unwanted mode. 

(ww) Providing mode transformers capable of changing unwanted 
modes into the fundamental one. 


III. RECOVERY OF THE FUNDAMENTAL MODE 


Physical interpretations of the deformation of a beam traveling 
through aberrated lenses and ways of preventing that deterioration 
are considered next. 

The field (11) after the nth lens is made essentially by the funda- 
mental mode slightly modified by higher-order modes. Neglecting 
powers of dn bigger than one, because of (13), and grouping the first 
three terms, 


Boy a Ge Die 290) = 3a, > dnD,(24). (16) 


The first term is a Gaussian beam different from the ideal one. Its 
axis is at a distance 


c, = Rea (17) 
from the beam axis of Fig. 1, and its half-width is 
w, = 1+ Re 2a, . (18) 


Both dimensions are normalized to the ideal beam half-width w. 
Furthermore, the angle between the two axes is 


6, eas a Im Qin (19) 
rw 
and the radius of curvature of the wavefront results 


tan — 


r= of _ am Im “| (20) 
2 
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Expressions (17) through (20) are valid as long as 


| Qin [<1 (21) 


| don | K1. 


These inequalities can be satisfied for an arbitrarily large number of 
lenses by periodically realigning the beam and changing its width to 
the proper size. 

The realignment of the beam can be made with redirectors.® A feed- 
back servoloop as shown in Fig. 3 senses the position of the beam with 
photosensors p; and pe. which are centered, for example, on the axis of 
the pipe in which the lenses are housed. 

The difference signal from the sensors is amplified in A and used to 
displace lens n — 1 laterally until the beam axis passes through the 
center of the photosensors. In general, at every servoloop the beam’s 
axis will pass through the center of the sensor. 

The beam size and the curvature of the wavefront can also be ad- 
justed with servoloops which we will call refocusers. The principle of 
operation is shown in Fig. 4. The beam is aligned with three lenses 
and three photosensors p,, 2, and p3, are placed at distances from the 
axis such that an ideal beam would produce equal signals. The dif- 
ference signal between p, and pe is amplified in A and controls the 
focal length of lens n — 1, while the difference signal between pe and pz 
is amplified in B and used to change the focal length of the nth lens. 
Once these differences are small, the three signals from the photo- 
sensors are practically identical and the beam coincides with the ideal 
one. 

Beam size correction is also possible changing the distance between 
lenses instead of their focal length. 

Obviously, if the lenses are three-dimensional instead of the two- 





Fig. 3 — Redirector:servoloop for beam realignment. 
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Fig. 4— Refocuser:servoloop for beam-width adjustment. 


dimensional considered above, similar servoloops must be used in 
two perpendicular directions. 

For gas lenses of the tubular type,? the beam deflection and focusing 
may be achieved by dividing the tube in four sectors (Fig. 5) and con- 
trolling the temperature T of each of them.7° If T; = T. = T3 = T4, 
the lens focuses only, but if Tz = T3 = Ts < 1, the lens focuses and 
deflects the beam downward. If 7; = T3; > T. = 74, the focusing in 
the vertical direction is stronger than in the horizontal direction. 

For focusing devices such as periscopic mirrors? (Fig. 6), the deflec- 
tion of the beam may be achieved by rotating one or both mirrors 
around perpendicular axes x and y. As suggested by R. Kompfner, 
beam refocusing may be obtained by mechanically deforming the 
mirrors in the two perpendicular directions. 

The beam losses in the process of beam refocusing and redirection 
are due to the interception of the beam by the photosensors, and, in 
principle at least, they can be made very small indeed. These devices 
then operate on the idea of reconverting higher unwanted modes into 
the fundamental. 

Unfortunately, it is not simple to make a mode converter capable 
of taking care of the modes higher than the second contained in the 
summation in (16). For them and also for the second mode, if re- 
focusers are not used, S. E. Miller suggested another technique which 


COOL 
GAS 





Fig. 5— Tubular gas lens for beam realignment and refocusing. 
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BEAM AXIS 


rr 


Fig. 6 — Periscopic mirrors. 


is essentially lossy, but may be simpler to implement. It consists of 
using mode filters, perhaps irises aligned with the centers of the re- 
directors’ photosensors. 

What are the powers involved in these filtering schemes? At the nth 
lens, the power in the second mode normalized to that in the funda- 
mental one is 


Pp» = 2 Qon ig (22) 


If dz, is real, the radius of curvature of the wavefront (20) coincides 
with the ideal one 2f, while the half beam-width departure from ideal 
results from (18), 


lw, —-1| = V2P. (23) 


At the same lens, the power carried by the higher-order modes is 


Pr = Qi | den |’ a). (24) 
q=3 
There is another filtering scheme which consists of using at every 
lens redirectors and filters capable of absorbing the second- and higher- 
order modes. The power absorbed by the filters in the n first lenses 
normalized to the power in the fundamental mode is 


P= 22 25d! | tale). (25) 
v=] g=2 
In the following section, several examples are considered and a 
comparison is made between the powers P‘”, P”, and P{ to find 
the most efficient way of avoiding beam deterioration. 


LIGHT TRANSMISSION 1743 


IV. EXAMPLES 


Let us assume that all the lenses are imperfect but identical, and 
that the aberration is of fourth order. The phaseshift introduced by 
the aberration 


9.(o,) = boy (26) 


means physically that at a beam half-width w from the center of the 
lenses, the phaseshift due to aberration is § radians, while the ideal 
phaseshift (10) is —2 tan 6/2. Another physical interpretation is 
provided by the focal length 


fo) = 1+ oe o3) (27) 


calculated from (10). At a distance w from the center, the focal length 
is 8f/tan 6/2 longer than ideal. For gas lenses,? a typical value for 
8 is 0.01.17 

Then, the coupling coefficients between the fundamental and the 
higher-order modes at the vth lens (15) are 


#2.8(1 + 40”) 


Co2v 


Cosy aoa te CO, 
: (28) 


Coty => 


‘16 
Coq = 0 for gq>4 


and the amplitudes of the different modes after the nth lens (14) turn 
out to be 


Q 
Ny 
3 

I 


125 DS (1 ots dee 
y=1 
a .6 = 13(vp—n)0 
A3, — 4 2 > o,€ 
oF (29) 


16 = t4(p—n)0 
Qan = Tr é 
an 16 > 


Qa = 0 for g> 4. 


To assign values to o, we consider two examples. 
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4.1 Beam Guide with Curved Axis 
The lens centers are on a circle of radius R and the beam axis 
intersects the rth lens at a distance 
Oo, = ho +- hy cos vé (80) 


from its center. This means that the beam axis oscillates sinusoidally 
with amplitude h, about a circle of radius R + how. 

The constant h; depends only in the beam launching conditions, 
while ho is related to the other parameters of the guide’® by 


fo ce 
7 Rat? 6/2 > wR Gi) 


Substituting (29) in (22) and (24), as well as (26) in (25), assuming 
6 to be of the order of 7/2 and neglecting terms that do not grow with 
n, the following powers are derived: 

)y 


; sin B/2né\" (hi 1) sin” an ‘ 
° | (h sin 3/20 TAs 16/ sin’ 26 (82) 


2 
pelt + 10(1é + iy + 12h3 + 36Achi 


ho 


sin 3/2n6 


sin. sin 2n6 
sin 3/26 sin 20 


PS ao 4 














+f (n4 


Pp? iz 


| 
bole 


3 
po 


Il 


sin 2n6 
sin 26 


sin 3/2n6 tn 
sin 3/26 ‘ 
To minimize these quantities, one must choose the distance L between 


lenses in such a way that 6 is different enough from 7/2 and 27/8 as 
to satisfy the inequalities 








oF ani(s aI ) + 12hohi 











and | (83) 





This choice prevents the systematic in-phase coupling of higher-order 
modes at every lens. This result can be extended to guides of identical 
lenses with any aberration. The separation LZ must be chosen in such 
a way that the period of oscillation of the beam about the axis does not 
coincide with an integer number of lens spacings. 
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If (33) are satisfied, the powers of (32) become 


PY = Sekhin’ (34) 
PP?’ =0 (35) 

2 
P® = po'[ + io(n8 + ii) + 12h5 + 36hcn7 + ant fu (36) 
Since P” = 0, there is no build-up of power in the third and fourth 


modes. The beam maintains a Gaussian profile and can be refocused 
without any power loss. 

The power in the second mode grows proportionally to the square 
of the number of lenses and to the fourth power of the amplitude of 
the beam axis oscillations, but is independent of h, and consequently 
independent of the radius of curvature R of the guide. 

If absorbing mode filters are used, one observes that, while P‘” 
grows proportionally to n’, (84), PS? grows only proportionally to n. 


There is cross-over at a number of lenses n, for which PS? = P®?. It is 
2 
nee = [1 4 10( i fa ui) 4 12h! + 36n2h2 + ant] (37) 
1 


If n < mp it is less power consuming to have one filter every n lenses. 
If n > ny it is better to use filters at every lens. 


For 
ho = 1,* 
hi = 1, and 
6 = 0.01, 


we calculate from P<” in (84) that the power converted to the second 
mode is 1 percent after 19 lenses. 

Furthermore, one mode filter at the 19th lens, or filters at every 
lens, would dissipate, respectively, 


Py = 0.01 equivalent to 0.0023 dB/lens 


and 
P{& = 0.049 equivalent to 0.011 dB/lens. 
* This value ho = 1 is derived from (81) using the following typical quantities: 
L=0.7m 
R=1km 
w= 0.5 mm 
0 = x/3. 
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It is roughly five times less power consuming to use one filter every 
19 lenses. This occurs because the conversions at successive lenses 
have enough phaseshift to interfere destructively and reduce the 
converted power level from 0.011 dB/lens to 0.0023 dB/lens. 

Given the lenses with aberrations and a length of guide D, is there 
less mode conversion crowding the lenses or keeping them far apart? 
To answer this question, (84) is rewritten substituting for n the ratio 


D/L, 
a _ (9 chi ; tan’ 6/2. 

Pow = (3 tan” 6/2 ” L (38) 
Because of the normalizations (8), (27), (30) to the ideal beam size 
w, the parenthesis is a constant and P{), is minimized by making 
tan’ 6/2/L’ as small as possible. From (2) 








This expression and consequently the power P5/, is independent of 
the wavelength \ and can be minimized by choosing the separation 
of the lenses as close to confocal as possible without violating the 
inequality (33). 

Following a similar line of thought one deduces from (82) that 
the power P\”’, in modes higher than the second, is reduced by choosing 
\ as small as possible. This result is illustrated next via a computer 
experiment’’ that goes beyond the limits of applicability of the per- 
turbation analysis developed in this paper. 

Consider a sequence of aberrated, aligned, two-dimensional lenses 
of width 2a, spacing L, and focal length 


j= fo + o.02(2) |. (39) 


A Gaussian beam of half-width w, which is the correct spot size 
for a sequence of ideal lenses of focal length f = fo, enters parallel 
to the guide axis at a distance a/3. Assuming 

L = 2f, (confocality) 
and 


’ 


Ris 
wil 


the distorted power beam profiles at the 167th and 168th lenses are 
illustrated in Fig. 7(a). 
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Fig. 7—Power beam profile after many lenses of focal length f = fol! + 
0.02(s/a)2]. (a) L = 2f (confocal lenses); w/a = 3. (b) L = 1.8f) (10% off confocal); 
w/a = 1/34/93 (shorter wavelength). 


Conversion to distorting high-order modes is substantially reduced 
by avoiding confocality and by reducing the beam width. For example, 
assuming 

L = 1.8f5 
and 

ee 

a W283 
the power beam profiles after the same length of guide, Fig. 7(b), 
are still close to Gaussian. 
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4.2 Beam Axis Randomly Dispaced from a Circle 


This beam axis occurs, for example, in the following beam guide. 
Assume a metallic tube in which the lenses are housed and whose axis 
is a circle of radius R. With each lens there is a redirector rigidly con- 
nected with the tube. To keep the beam away from the wall, the photo- 
sensors’ centers should coincide with the tube axis, but they don’t 
because of alignment tolerances. At the nth photosensor, their separa- 
tion is a Gaussian random length d, , Fig. 8, which we have normalized 
to the beam half-width w. 

The beam axis forced to pass through the center of every photosensor 
will also depart from the tube axis d, . 

From Fig. 8, one finds that d, , R, L, and f are related to the normal- 
ized distance o, between beam axis and lens center by the expression 


on, = 2 dn = Cigsi —_ dasa + ho (40) 


in which hy is once more the constant defined in (81). 
Since d, is a Gaussian random variable, it follows from (35) with 
obvious nomenclature 


(o) = No (41) 

(0°) = 64") + he (42) 

(o*) = 108(d’)? + 36h5(d7) + hi . (43) 
oa ee (In dn) 
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Fig. 8— Beam axis at random distance d, from a circle of radius R. 
On = 2dn —dn—1 — dnt + ho; ho = Lf/wk. 
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Now we calculate from (22), (24), and (29) the expected power in 
the second mode and in the higher-order modes at the nth lens 


(Pr) = 38((o") — (o’)’)n (44) 
‘ al cae P 1 /sin 2n6\’ ( sin s/ane\ 
(yy _ 3 gas Pesirad) (peace aped reid de ee - 
ee ate |) Oe 16 (& a) + (Ro sin 3/20 (45) 
The expected power to be absorbed by mode filters at every lens is 
deduced from (25) and (28). It is 
(PX?) = 36°(1 + 10(07) + 12(0%))n. (46) 


More explicit results are obtained substituting in the last three 
expressions the averages (c), (c°), and (o*) with their equivalents 
in (41) through (48): 


(P?) = 1088%d?)(h + 3(da”))n (47) 
(2) 92/72 5 ( sin zn 3 ( sin B/2né\! 
Fe po SOE Mh res 0 og) Fo \O ana /20 (48) 


(P.) = 38° [1 + 10K6 + 12h5 + 12(d*)(36h5 + 5) + 1296(d)"In. (49) 


The power in the second mode grows proportionally to the number 
of lenses, and if h2 >> 3{d’), is proportional to the mean square dis- 
placement and also proportional to hj. 

To prevent build-up of (P) proportionally to n’, it is necessary 
to avoid choosing 6 = 27/3 or 6 = 7/2. 

In general, (P*”) «< (P{); therefore, it is less power consuming 
to use beam refocuser and mode filters after several lenses and not 
at every lens. 

For 


it 
5 = 0.01 
VP) = 0.1 


an expected power conversion to the second mode of 1 percent occurs 
after 90 lenses. At that lens (47), (48), and (49) become 


(Ps) = 0.01 equivalent to 0.00048 dB/lens 
(P§3) = 0.0008 equivalent to 0.00004 dB/lens 
(P§) = 0.095 equivalent to 0.0047 dB/lens. 


1750 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1967 


If only mode filters every 90 lenses are used, the loss is 0.00052 
dB/lens. If beam refocuser and mode filters are used every 90 lenses, 
the loss is 12 times smaller, 0.00004 dB/lens. 

Differently from the previous example, the conversion per unit 
length is reduced by choosing both the separation L between lenses and 
the wavelength A as short as possible. 


Vv. CONCLUSIONS 


A beam transmitted through few tens of identical misaligned and 
aberrated lenses becomes distorted due to coupling to unwanted 
higher-order modes. Unless the beam is restored to ideal shape, the 
distortion continues until power is lost through the edges of the lenses. 

In general, mode conversion per unit length of guide is minimized 
but not eliminated: (7) by choosing the distance between lenses such 
that the period of oscillation of the beam does not coincide with an 
integer number of lens spacings; (27) by reducing the spot size, that is, 
by using short lens spacing and wavelength. 

Most of the converted power goes to the first and second modes. 
They change the beam path and the beam size, respectively. In prin- 
ciple, both can be corrected with negligible loss by means of servo- 
mechanisms which redirect and refocus the beam. 

Power converted to higher modes than the second distort the wave- 
front and must be absorbed by mode filters such as irises, for example. 

For lenses with fourth-order aberration such that at a beam half- 
width from the center the focal length is 1 percent shorter than on 
axis, a 1 percent power conversion to the second mode occurs after 
few tens of lenses. Mode filters every few tens of lenses restore the 
original beam with losses of the order of 0.001 dB/lens. If refocusers 
and filters are used simultaneously, the second-order mode power is 
recovered and the losses are reduced by one order of magnitude. 

Mode filters at every lens are, in general, lossier. 

Long distance transmission through aberrated lenses such as our 
present form gas lenses seems possible, but it hinges heavily on our 
ability to build efficient and reliable redirectors, refocusers, and mode 
filters. 


APPENDIX 
Field in a Sequence of Distorted Lenses 


The field on the surface a, (Fig. 1) is made of a superposition of 
normal modes 
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Blan) = Qo dnD,(28). (50) 
This field is related to the field H(8,) on the surface 8, by the phase- 
shift ¢,(o,+€) given in (9). Thus, 


Ean) = E(B,) exp [tgn(on + §)]. (51) 
Furthermore, the field #(8,) on the surface , is related to that on the 
surface an_1 through the phaseshift of every mode, 


E(6,) = ps dani exp (—196)D, (28). (52) 


Substituting (50) and (52) in (51), we obtain 


D aenDa(28) = exp lieu(on + 01D dax-vD«(28) exp (ig8) (68) 


and because of the orthogonality property of the parabolic cylinder 
function 


Gan = 2s Apn—1Cpqn EXD (—7ipé), (54) 
p=0 
where 
[exp fiex(on + 91D; (28) D,(28) dé 
Cyan = SS (55) 
[Dies ag 
is the coupling coefficient between the pth and gth mode at the nth lens. 


We are interested only in small lens distortions and small beam 
departure from ideal; therefore, 


leon +&) | Kr 
Qan<K1 for qg>0. 


Accordingly, keeping only first-order perturbation terms and expand- 
ING ¢vn(o,+€) in Taylor’s series, we obtain for (54) and (55) 


(56) 





Aan = De Cogy EXD [7g mz n) 6] (57) 
y=1 
and 
1 if g=0, 
Cogr = ° ee) at2p (58) 
Z oF 1 al '¢.(o,) if g>Q0. 


2°q! u=0 2**u! dor**# 
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Communications and Radar Receiver Gains 
for Minimum Average Cost of Excluding 
Randomly Fluctuating Signals 
in Random Noise 


By STEPHEN S. RAPPAPORT 
(Manuscript received October 25, 1966) 


The problem of automatic gain control ts approached from a statistical 
point of view. A simple generic equation is found whose solution yields 
the required receiver gain or attenuation for minimum average cost of 
excluding (from the receiver’s limited dynamic range) randomly fluctuating 
signals in random noise. A canonical phase-incoherent link is considered 
and the resulting transcendental equation is solved using an iterative 
technique. The analysis and the results obtained apply to both linear and 
nonlinear incoherent receivers including those of the logarithmic or lin-log 
type and to a range of fluctuation models including Rician, Rayleigh, 
and nonfluctuating cases. It 1s shown that the optimum receiver gain ts 
relatively insensitive to the ratio of costs of saturation at the wpper and 
lower dynamic range bounds, differing at most by about 8 dB from the 
optimum for the equal cost (minimum exclusion probability) case for 
typical parameters. The effect of noise introduced by the gain adjustment 
cascade ttself 1s discussed. 

The results, presented in concise normalized form, are applicable to 
a wide range of signal, noise, and channel conditions and have important 
amplications for communications through fading channels as well as for 
radar observation of fluctuating targets. 


I. INTRODUCTION 


Since the ability of both communications and radar receivers to per- 
form satisfactorily can be seriously degraded when the signal ampli- 
tude does not lie within the dynamic range of the receiver, the setting 
of receiver gain to minimize or prevent saturation at the upper and 
lower dynamic range bounds is an important problem. The problem 
arises in various forms. In simple receivers, the gain might be fixed 


1753 


1754 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1967 


to optimize performance for nominal signal and noise parameters. 
More complicated receivers can adjust the gain automatically by any 
of several methods. The most common AGC circuits, for example, use 
a time averaged baseband signal as an indication of signal strength. 
Another possibility is to have the gain adjusted on command from a 
digital computer. This latter configuration has important implications 
for communications terminals which can use sophisticated techniques 
for estimation of signal and noise parameters as well as for certain 
radars which must observe from look-to-look radar targets of differ- 
ent cross-section which have been illuminated by various transmitted 
waveforms. The analysis presented here does not depend on the 
particular configuration and is applicable to both linear and non- 
linear receivers including those of the logarithmic and lin-log type. 
The application to a nonlinear receiver can be accomplished by re- 
ferring the overall dynamic range of the signal processing chain to 
a point before the nonlinearity. 

In a recent correspondence, Ward? determined the placement of 
dynamic range bounds to minimize the probability of excluding a 
Rayleigh distributed signal. This was extended by Rappaport? who 
determined dynamic range bounds for minimum probability of exclud- 
ing a signal from the dynamic range of incoherent radar or com- 
munications receivers. The viewpoint taken there? considered ran- 
domness due either to background noise or target fluctuations (channel 
fading). This paper considers several further generalizations of the 
problem. The case in which the randomness is due to both causes 
together is treated. In addition, the criterion for optimization is taken 
as the minimum average cost of exclusion. The required receiver 
gains as well as the optimum dynamic range bounds are determined. 

The present paper proceeds from the specific to the general. That 
is, first the determination of optimum dynamic range bounds for 
minimum exclusion probability with non-fluctuating target (no chan- 
nel fading) is presented. 

The criterion is then generalized from minimum exclusion prob- 
ability to minimum average exclusion cost; the former being a special 
case of the latter. Finally, dynamic range bounds and receiver gains 
for minimum average exclusion cost in an environment of fluctuating 
targets or channel fading is determined. It is assumed throughout 
that the signal, noise, and fluctuation parameters are known to the 
receiver. By letting the parameters involved assume certain values the 
relations for the fluctuating case reduce to the nonfluctuating case. 
Hence, the general treatment presented here includes either criteria 
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and either the case of fluctuating or nonfluctuating SNR. Rician and 
Rayleigh SNR fluctuations are considered. 

Fig. 1 shows a model for an incoherent radar or communications 
link. The blocks labeled K,, Ke, and Kg represent variable gain de- 
vices (perhaps, variable attenuation pads) whose total gain K = 
K,K2Kz3 is to be adjusted so that the random signal appearing at (EZ) 
is in some sense confined to a specified range. The model used for 
the propagation medium and/or target is described in Section IV. 
Extension of the explicit results obtained here to an important class 
of nonlinear receivers by conceptually including a zero-memory 
nonlinear device between points (D) and (£) will be described subse- 
quently. The other blocks in the figure require no further explanation. 
The figure is presented so that the reader can obtain a clear under- 
standing of where in the signal processing chain various quantities 
arising in the following analysis are being determined. However, the 
analysis applies to incoherent signal processing links in general and 
is not constrained, for example, by the number of components or IF 
frequencies that may be used. 

The receiver structure shown in Fig. 1 may be used for recovering 
the envelope of a transmitted sinusoidal signal or it may represent 
an optimum incoherent receiver for the detection of finite duration 
signals of known form in a background of Gaussian noise. The 
probability density function (pdf) of interest in the former case is 
that of the voltage appearing at the input to the video processing 
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Fig. 1— Model for an incoherent radar or communications link. 
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[point (Z) in Fig. 1] while in the latter case the pdf of concern is 
that of the voltage at the same point in the receiver at the decision 
time only. Hither of these cases gives rise to a Rician pdf. Differences 
between the two are reflected only in the definition of a suitable 
SNR. # 

The analysis presented here can be easily extended to determine 
optimum gain settings for an important class of nonlinear incoherent 
receivers; namely, those in which the nonlinearity can be represented 
by a memoryless nonlinear device acting on the envelope of the received 
signal. For example, logarithmic or lin-log receivers can be repre- 
sented by a logarithmic or lin-log characteristic inserted between 
points (D) and (£) regardless of whether the actual nonlinear device 
is a video or IF amplifier. One needs only to first refer the dynamic 
range and equivalent limiting voltages from the output of the non- 
linear characteristic [point (H)] to the corresponding values at the 
input to the characteristic [point (D)] and then to determine the 
gain setting by considering only the linear portion of the receiver. 
In what follows it will be assumed that this first step has been taken 
if necessary and only the linear incoherent receiver will be treated 
explicitly. 


II. DYNAMIC RANGE BOUNDS FOR MINIMUM EXCLUSION PROBABILITY WITH 
NONFLUCTUATING SNR 


For nonfluctuating SNR the voltage gain of the radar or communi- 
cations link is fixed. It is convenient to assume (without loss of 
generality) that the voltage gain y, of the propagation medium 
and/or target [i.e., the portion of the link from (A) to (B) in Fig. 
1] is unity. In the more general case of fluctuating SNR, the voltage 
gain of this portion of the link will be treated as a random quantity. 

The optimum placement of dynamic range bounds for incoherent 
receivers is determined by the pdf of the envelope detected signal, v, 
which appears at point (D) in Fig. 1. Let o be a normalization 
parameter and define 


R = normalized envelope of received signal = v/o 
a = lower normalized bound of dynamic range 

ad = upper normalized bound of dynamic range 

D = 20 log,. d = dynamic range in dB, 


I 


where these quantities are referred to point (D) in Fig. 1. If p, (R) 
denotes the pdf of the normalized envelope, the corresponding exclu- 
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sion probability is 


Ped 21> [ p,(R) dR. (1) 


To minimize P,(a, d) with respect to a for fixed dynamic range d, 
(1) is differentiated with respect to a and this derivative is set to zero. 
This yields the necessary optimization condition 


py(a) = dp, (ad) (2) 
which must be solved for a. Consider an optimum incoherent receiver 
for detection of signals of known form in Gaussian noise. Let 2c? be 
the mean square value of the signal voltage envelope, v, when only 
noise is present. In this case, the pdf of the normalized signal envelope 
is 


»(k) = R exp [—R’/2]. (3) 
The optimization condition (2) for this case leads to 
2 In (d) 
A= Fea 7 A A (4) 


in which A = a/ 1/2 is the optimum normalized lower dynamic range 
bound. When signal-plus-noise is present the probability density of 
the normalized envelope has a Rician distribution®* 


p,(R) = R exp [—(R* + y’)/21.4R) (5) 
in which 


I,(x) = modified Bessel function of first kind and order zero 
vy = voltage signal-to-noise ratio for y = 1. 


In this case condition (2) gives the following transcendental equation 
which must be solved for the optimum normalized lower dynamic 
range bound A = a/ oa 


=L+ ey ap Un LA dyV2) — in L4yV9)], (6) 
The minimum exclusion probability becomes 
P.(a, d) = 1 — Qy, AV2) + Qq, AdV‘2) (7) 


in which Q(a, 8) is the tabulated® Q-function defined by 


6G/p-= I ” g exp [-(@ + a2)/2I (at) dé. (8) 
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Solutions to (6) for various y and d are presented in Ref. 2 along with 
minimum exclusion probabilities for this case. The solutions can be 
obtained from Fig. 2 with y used in place of 7 and A in place of A. 


III. DYNAMIC RANGE BOUNDS FOR MINIMUM AVERAGE EXCLUSION COST 
WITH NONFLUCTUATING SNR 


In certain situations it may not be desirable to use dynamic range 
bounds which minimize the exclusion probability. It may be reason- 
able to favor saturation at one dynamic range bound to the other. 
In the case of a radar, for example, the signal is invisible if it falls 
beneath the lower dynamic range bound, while if the receiver satu- 
rates at the upper dynamic range bound the presence of the signal 
would be detected although its information content would be cor- 
rupted by the limiting. In such cases, a more reasonable criterion 
might be to minimize the average cost of excluding the signal from 
the receiver’s dynamic range. 

Suppose that when the signal falls below the lower bound a loss, 
C,, is incurred, while saturation at the upper bound causes a loss, co 
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Fig. 2— Optimum dynamic range centering for vas = 0. 
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(C1, Co > 0). The expected or average cost of excluding the signal 
from the dynamic range is 


L=a [| py(R) ak +o i: p,(R) aR. (9) 


It is convenient to divide (9) by c, to obtain the normalized 
average exclusion cost 


i= [r,an+> | pl) ar (10) 


in which » is the cost ratio, »y + c:/c,. It is seen that for » = 1 the 
normalized average exclusion cost, 1, reduces to the exclusion prob- 
ability. In order to minimize the average exclusion cost, the derivative 
of (10) with respect to a is set to zero. One then finds that the optimum 
lower normalized dynamic range bound a must satisfy 


p_(a) = v dp,(ad), (11) 


which reduces to (2) for v = 1 as it should. For the incoherent 
receiver substitution of (5) in (11) leads to the following transcen- 
dental equation for the optimum normalized lower dynamic range 
bound* 

1 


AP = Al+ Al + Gey lin L(A dy V2) — In L(AyV2)] (12) 


in which by definition 


In pv ; 
d—1 





Aj = (13) 


It is noted that it is entirely possible for c. to be less than c, making 
A? negative. However, the sum A? + A? is positive if vd” is greater 
than unity. Using (12) it is seen that for y = 0, i.e., Rayleigh distributed 
envelope, the optimum normalized lower bound can be determined 
explicitly from 


Arn = A?+ A>. (14) 


When the cost ratio, v, is unity (14) reduces to (4) as expected. It is 
convenient to measure the cost ratio in dB using 


Vaan = 20 logs Vv. (15) 


* The desired lower dynamic range bound is the positive real root of (12). 
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Thus, positive values of vag imply c. > ¢,, while negative values 
imply co < c,. vas = O is the case of minimum exclusion probability. 
If vas = —2D, then A? is 0 and A® = 0 solves (12) for any y. For the 
foregoing formulation to be meaningful, A” must be positive. Hence, (4), 
(12), (13), and (14) require that v4, > —2D. If this constraint is not 
satisfied, then the average exclusion cost, J, is not stationary with 
respect to A. For given y, d, and » it is generally necessary to solve (12) 
for A. This equation is of the general form x = f(x). A proposed scheme 
to find the solution is to iterate xz;.,; = {(v;) beginning with an approxi- 
mate solution x, . It can be shown that this scheme will converge if the 
magnitude of the derivative of the RHS, | f’(z) |, is less than unity in the 
neighborhood of the solution, x,. Moreover the convergence is faster 
as | f’(z,) | is closer to zero. In order to speed convergence an extrapolated 
iteration scheme can be used by introducing another parameter, 8. 
Consider the equation 


x = f(x) — Blx — f(a)]. (16) 


Provided 8 ~ —1 the solution to this equation is the same as that 
of « = f(x). If one could choose 


_ _}'G@s) , 
B a LS f'(x,) VG.) = 1 (17) 


the derivative of the RHS of (16) would be zero at x,. However, 
since xz, is not known at the outset the approximate solutions are 
substituted for x, in (17) to speed convergence. 

Using this approach (12) can be solved to any desired accuracy with 
the aid of the iteration formulas 


Aria = FP; —— BAA; = FP) B; ~ —I (18) 
Fes A Ar ei p ln LA, dyV2) — InI,(AwV2)]_ (19) 


1Vv2 | ylAsdyv2) _ iifderv2) | 
2° — NAL IYA;dyV2)— (Av V2) 
B=G/1—-G) G,#l (21) 
in which J,(x) denotes the modified Bessel function of the first kind 
and nth order. One may begin with small values of y, 7 = 0, A? given 
by (4) and A? given by (13). The iteration is stopped when | A;,, — A; | 
is less than the allowable error. Equation (12) was solved by this 
method for various values of vaz3 , D(dB), and y(dB). 
For d > 1 an approximate solution to (12) can be found explicitly. 


GS 





(20) 
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Neglecting the second term in brackets in comparison with the first 
and taking J,(a) = e* reduces (12) to a quadratic equation in Ad 
which has the solution Ad & (y/V2) + VW(7’/2) + Invd®. Thus, 
the optimum normalized upper bounds (as shown in Fig. 2 for » = 1) 
continue to increase slowly as D increases. 

The solutions obtained using (18) to (21) show that the optimum 
lower bounds for a wide range of cost ratios do not differ appreciably 
from those for van = 0. The difference (in dB) in optimum lower bounds 
for values of ran = +25 and rag = 0, and for rag = +50 and raz = O 
for various values of ¥ and d are shown in Fig. 3(a) and (b). (For non- 
fluctuating SNR take 7 in the figures as y and A as A.) It can be seen 
that for any given values of v and d, the largest difference is for 7 = 0. 
This maximum deviation can be determined explicitly. From (14) 


Az = Aj(1 + A?/A)). (22) 
I’quations (4), (13), and (15) then yield 
20 logio Ar — 20 logiy A, = 10 logiy (1 + van). (23) 
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Fig. 3.— Increase in optimum normalized dynamic range k bounds or in required 
receiver attenuation due to nonunity cost ratio. (a) vas = —25,25; (b) van = 
—50,50. 
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Thus, the maximum difference in optimum dynamic range bounds 
is determined by the ratio of the cost ratio in dB to the dynamic range 
in dB. A plot of (28) is shown in Fig. 4. 

The fact that the optimum bound is relatively insensitive to cost 
ratio at least for large D and large 7 is an important one since exact 
assessment of the costs c, and c, is difficult or impossible. However, 
this analysis shows that for large 7 and D an optimum solution for 
van = Ois nearly optimum for —D S vay S 2D. For ¥ = 0 the optimum 
dynamic range bounds are most sensitive to cost ratio but in this 
region differ only by about 3 dB from the optimum for » = 1. 

When the optimum normalized lower bound, A, is determined, the 
normalized minimum average cost of excluding the signal is given in 
this case by _ 

= 1- Qy, AV2) + QW, AdvV2). (24) 
These minimum average exclusion costs are shown in Fig. 5, in which 
the parameter ¥ is to be taken as y. For vag = 0, (24) becomes the 
minimum exclusion probability (7). 


Iv. A MODEL FOR TARGET FLUCTUATION AND FADING CHANNELS 


Thus far this paper has considered the case where the SNR at the 
receiver is fixed. However, in the case of radar the target cross section 


9720 LOGig Ao OR 


y= 


An (v,0,@)~An(1,0,7) IN DECIBELS 


20 LOGigA 





Fig. 4— Maximum change in optimum normalized dynamic range bounds or 
in required receiver attenuation due to nonunity cost ratio. 
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Fig. 5— Minimum average exclusion costs. 


presented to the receiver generally varies, while in communications 
links the phenomenon of fading generally causes fluctuation in the 
received SNR. Consider the case in which the SNR fluctuation is 
slow so that it is essentially fixed for the duration of a given signal 
but will fluctuate over longer time intervals. 

Following Turin® it is assumed that the medium from the trans- 
mitter to the receiver can be characterized as propagating two com- 
ponents, a fixed or specular component and a completely random or 
scatter component. Thus corresponding to a transmitted signal 
Re {s(é) exp (jw,t)} the reciever’s IF signal [point (C) in Fig. 1] with 
K = K,K.K, = 1is given by Re {x(t)} where 


a(t) = s(t — 7)[a exp (—76) + S exp (—j¢)] 
-exp (jw t) + n(t) exp (jw). (25) 


In (25) @ and & are fixed while S and » are independent variates; 
S having a Rayleigh pdf with mean square 2»? and g a uniformly 
distributed phase over an interval of 27. w, and w, denote the angular 
frequencies of the transmitted carrier and the receiver intermediate 
frequency, respectively. n(t) is a narrowband Gaussian noise process. 
It can be shown* * that the joint distribution of the resultant ampli- 
tude, y, and phase, @, of the sum of the fixed vector (a, 8) and the 
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random vector (S, ¢) is 


Ve exp E V+ — Zap cos (6 — a | 
2Qrpu Qu 


fo OS yp 05 6-—6S8 2r. (26) 


In (26) a can be regarded as the strength of the fixed component 
while x” is proportional to the strength of the scattered component. 
The quantity 6° = a’/y? is twice the ratio of the energy received via 
the specular component to that received via the scatter component. 
The variates y and 6 are, respectively, the instantaneous voltage gain 
and phase shift of the path from the transmitter to receiver and 6 
is the average phase shift of the path. Note that (26) is just the two 
dimensional Gaussian distribution in polar form. The pdf of y is found 
by integrating (26) over the range of @ giving®’® 


p(y) 2 eXD | - FE |e) for y20 (27) 








p(y, 0) = 


ll 


0 elsewhere. 
Letting r = W/p, (27) becomes* 


pa(r) = 7 exp [—(@" + 6°) /2]T.(6r). (28) 
The voltage gain of the propagation medium and/or target [from 
(A) to (B) in Fig. 1] is Y = pr. The model above is an adequate 
representation of propagation conditions which are encountered on 
ionospheric and tropospheric radio links.* The pdf (27) is sufficiently 
general since as 8 approaches zero (no specular component) (27) 
becomes the Rayleigh distribution with parameter » while if B ap- 
proaches infinity (presence of specular component only (27) may 
first be approximated by a Gaussian pdf of mean a and variance p? 
and in the limit by a delta function, 8(~—a@) corresponding to the 
case of no SNR fluctuation. Radar target fluctuations have been 
described by Rayleigh statistics’ a special case of the above model 
(8 = 0). In this case, p? is proportional to the average target cross- 
section. It is reasonable to expect that radar targets which can be 
modeled as a single large reflector plus a large number of independent 
scatterers will yield signal returns of the form (25). For this more 
general Rician fluctuating target p»?(1+?/2) is proportional to the 
average target cross-section. 


I 


* Note that (28) is a pdf of the same form as (5) as it must be since either 
equation is the probability density of the magnitude of the sum of a constant 
vector and a Gaussian vector. 
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V. GENERAL CASE: DYNAMIC RANGE BOUNDS FOR MINIMUM AVERAGE 
EXCLUSION COST WITH FLUCTUATING SNR 


The phenomena of radar target fluctuation and channel fading are 
evidenced by fluctuating SNR in the receiver. To account for these 
fluctuations, y in (10) must be weighted by a random voltage gain 
ur, where uw is a parameter and r is a random variable whose pdf #(r) 
determines the form of the SNR fluctuation. The normalized average 
exclusion cost can then be obtained from (10) giving 


I= | p(k) ak +» [ PQ) aR, (29) 


where 


Bux R) = [ Bol0)Peny(R) ar 0 


In (30), y is the voltage SNR at the receiver for unity channel gain, 
Le., for Y = ru = 1. The product ruy appearing in the integrand is 
the voltage SNR at the receiver for a particular realization of the 
random gain y = rp; that is, the “instantaneous” voltage SNR at the 
receiver. 

The condition for minimization of the average exclusion cost (29) 
becomes 


Duy(a) ae dpi, (ad). (31) 


Consider phase incoherent reception of signals in Gaussian noise with 
fading or target fluctuations described by the probability law (28), 
i.e., Ba(r) = pa(r). In this case the integral appearing in (30) becomes 


Pin(R) = [Re exp [- + 6/2110 


cexp [—(w'y’f + R°)/210.yRD) dt (32) 


which can be evaluated using an identity in Watson* giving 


: ee See E (R? + aay ( uBR ) 
Duy(R) a 1 4 by exp 2(1 + pry’) I, 1 Ae ey (33) 


To evaluate the average exclusion cost (29) one needs to integrate 
(82) or (83) with respect to R from some number 7 to o. This 
integral of (33) can be easily evaluated using the definition (8). 


* See Ref. 8, p. 395. Take Watson’s a = twyR, b = 78, » = 0} p? = (1 + p?y?)/2. 
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Performing the integration with respect to R in (32) then establishes 
the result} 


[pa ae = [ bexp [+ F/2U RD Qurt, 2) a | 
0 n) (34 





& a |) a ). 
V1 wy? V1 + 
Using (34) in (29) yields 


ns 0 His ——t=s5) 
Vite Vite 
uyB ad ) 

z AT; Py vite 
in which a is the optimum normalized lower dynamic range bound 
obtained as a solution to (31). By substituting (33) in (31) and 
manipulating the result it can be shown that the optimum a must 
satisfy 


a 1 
ge BO 2 ee tae ee 
a0 + ey) Veet Ga) 


. E) (a0) | 
in 1; zs wy In I 1 + by (36) 


Let 
Ae es (37) 
V2V1i + 
and 


t= e/a (38) 
Vite 

From (338) it can be seen that if 8 is zero the mean square value of 
R is 2(1 + p’y’). Thus A is the optimum lower dynamic range bound 
normalized to the rms voltage that would appear at the output of the 
envelope detector [point (D) in Fig. 1], if the specular component were 
zero. Since u8 = a the quantity 7 in (38) is twice the ratio of the rms 
voltage that would appear at (D) when only the noiseless specular com- 





+ The integral to the right of, the first equality in (34) would appear if the 
average cost for the nonfluctuating case is determined first as in (24) and then 
this cost is averaged over the random fluctuations of y. 
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ponent of (25) is present, to the rms voltage that would appear if only the 
scatter and noise components of (25) were present. The mean square 
voltage at the output of the envelope detector when specular, scatter, 
and noise components are present is 2o°[1 + py? + (a’y’)/2]. Using (37) 
and (38) in (86) gives 
A? = AP+ AD + eo [In 1,(7A dv/2) — n 1,(7A-V2)] (839) 
and (85) becomes 
l=1-Q4, AV2) +04, Adv2). (40) 


Equations (39) and (40) are of the same form as (12) and (24), re- 
spectively, with y replaced by 7 and A by A. These results show that 
the optimum dynamic range bounds and performance curves obtained 
previously for nonfluctuating SNR can be used directly for the more 
general case of fluctuating SNR by merely changing the variables 
via (37) and (88). Therefore, although there are two additional param- 
eters in the fluctuating case it is not necessary to increase the number 
of curves to describe performance. For va3 = 0 the criterion reduces 
to the minimum average exclusion probability as in the case of non- 
fluctuating SNR. 


VI. RECEIVER GAIN REQUIRED FOR THE GENERAL CASE 


The optimum gain or attenuation required for insertion in the 
signal processing chain at a point preceding the components which 
limit the dynamic range can now be calculated. Let c be the lowest 
voltage at which the signal processing chain can operate satisfactorily, 
referred to the output of the envelope detector.* Optimum dynamic 
range utilization requires that the signal be multiplied by a factor K 
such that the scaled lower normalized dynamic range bound is equal 
to the voltage c, when normalized to the same base. That is, 


Ka = c/o. (41) 
Substituting from (87) for a and transposing, (41) gives 
K(oV2/) V1 + wy = At (42) 


in which A is the solution to (39). Denote the LHS of (42) as K,, 
the normalized voltage gain, and let A, be the normalized required 
attenuation in dB. Then 
A, = —20 logis K, = 20 logy, A (48) 
* Point (D) in Fig. 1. 
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which expresses the normalized required attenuation in dB as a function 
of the optimum normalized lower dynamic range bound. Since A is 
the solution to (39) A, depends only on », 7, and d. It is fortunate 
that the normalized results can be expressed in terms of only a few 
parameters since this permits a concise description of optimum per- 
formance for many signal, noise and channel conditions. Optimum 
normalized attenuation required for the case of minimum exclusion 
probability (vag = 0) is shown plotted in Fig. 6. From (48) the actual 
required attenuation in dB can be obtained. Let A = —20 logy, K 
denote the actual required attenuation in dB. Then (42) and (48) yield 


A= A,(v, 7, d) + 20 logio (s V2/c) + 10 logy, (1 + n'y’) (44) 


in which the functional dependence of A, is shown explicitly. 

From (43) and (44) it can be seen that the difference in optimum 
receiver attenuations is the same as the difference in optimum normalized 
dynamic range bounds. Hence, Fig. 8(a) and (b) also show the differences 


Ay, v; d) = A,(1, y; d) (45) 


for values of vag = +25, +450, and various values of 7. For given 
v, ¥, and done can, therefore, determine A,(», 7, d) by finding A, (1, 7, d) 
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Fig. 6— Required attenuation for minimum exclusion probability. vas = 0. 
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in Fig. 6 and adding the difference (45) found in Tig. 3. Fig. 4 is a 
plot of the differences 


A,(v, 0, d) — A,(1, 0, d), (46) 


that is, a plot of (45) for zero SNR. For given v and d these differences 
are of the same sign as (45) but are always larger in magnitude. Hence, 
Fig. 4 shows the maximum change in optimum receiver attenuation 
due to a nonunity cost ratio. 

The definitions of the parameters appearing in (44) are summarized 
by the following list: 


vy = cost ratio 
o = noise power with no fluctuation 
y = voltage signal-to-noise ratio for unity propagation 
and/or target gain (i.e., Y = 1) 
uw = strength of scatter component of the propagation 
medium 
B° = twice the ratio of strength of specular component of 
the medium to that of the scatter component 
u (1 + 6’/2) = for the case of Rician fluctuating radar targets this 
quantity is proportional to the average target cross-sec- 
tion over all target fluctuations. 6 = 0 corresponds 
to the case of Rayleigh fluctuating targets 
= dynamic range of receiver 


= pyB/V1 + py’? 


= 9 rms voltage at (D) for noiseless specular component only . 
a rms voltage at (D) for scatter and noise components only 


2 2 


In the general analysis presented here, which includes fluctuating 
or nonfluctuating SNR and the criteria minimum average exclusion 
cost or minimum exclusion probability, special cases which may arise 
in various applications. are represented when the parameters take on 
particular values. Some special cases are shown in Table J. The entries 
in the table are for either criterion. 


TABLE I — CoNSTRAINTS ON PARAMETERS FOR SPECIAL CASES 





Type of fluctuation Type of envelope 
Constraints on parameters or fading detected signal 
p>d,8>0, y >0 Rician Rician 
up>0,8 =0, y >0 Rayleigh Rayleigh 
w—20,8 > ©,pB =a,y>0 none Rician 
uB =a,y = 0 none Rayleigh 
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It is noted that for u, 8, and y greater than zero one has the general 
case of Rician SNR fluctuation and Rician envelope detected signal. 

When £8 is zero the medium from transmitter to receiver does not 
propagate any specular component and the envelope of the received 
signal has a Rayleigh pdf independent of the other parameters. Setting 
6 to zero in (83) shows that the envelope of the received signal in this 
case has a mean square of 2c°(1 + y’y’). When y is zero only noise 
at the receiver is demodulated again giving rise to a Rayleigh distributed 
envelope but of mean square 2c” independent of the other param- 
eters. In each of these two cases (i.e., 8 = 0 andy = 0), the optimum 
normalized lower bound is found from (14), Ag = WA? + A”. Since 
the minimum average exclusion cost (40) depends on the value of A 
the minimum costs are equal for these two cases. However, it can be seen 
from (44) that the actual optimum lower bounds or actual required atten- 
uations for these cases differ. This is because the quantity, o7V1 + yy’, 
to which the received signal voltage envelope is normalized is different in 
these instances. Note that in the former case (6 = 0) the required 
receiver attenuation (44) is affected by the randomness of the scatter 
component while in the latter case (y = 0) it is not. This can also be seen 
from (25). When 8 is zero there is no specular component and the 
received signal (25) depends upon the scatter component while if y is 
zero the entire first term can be omitted and the received signal consists 
of only noise. 

When py goes to zero and @ approaches infinity such that y@ = a 
(a constant), the medium from transmitter to receiver propagates only 
a specular component with a voltage gain of a. In this case there is 
no SNR fluctuation (nonfluctuating case) and the envelope of the 
detected signal is Rician if y > 0 and Rayleigh if y = 0. Letting » = 0 
and w8 = a in (38) and (388) shows that for this case the SNR at the 
reciever Is wy, a result which is clear from (25) if the scatter component 
of the medium is deleted. There is no essential loss in generality in 
this case if a is taken as unity. With » = 0 and a = yB = 1 in (33) 
that equation reduces to the pdf considered in Sections II and III. 

For y > 0, » > 0, @ finite, the optimum gain settings for the fluctuating 
and nonfluctuating cases differ and the minimum average exclusion 
cost (probability) for the fluctuating case will be greater for the same 
values of y, d, v, and a. 


VII. EFFECT OF NOISE INTRODUCED BY THE GAIN ADJUSTMENT CASCADE 


In the foregoing discussion, the attenuation or gain required for 
optimum dynamic range utilization has been idealized as a multiplica- 
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tive parameter. These results apply when the noise introduced by the 
gain adjustment cascade itself is virtually independent of its gain. 
This condition is often realized in practice. When this condition is not 
satisfied some modification is necessary. The phenomenon can be 
represented by using an equivalent noise source at the point in the signal 
processing chain where the dynamic range calculations are being made. 
The quantities o and y used previously must be replaced by equivalent 
o, and y., respectively. Let ¥,(G) be the operating noise figure’ of the 
cascade when it is set for an available gain of G (dB). Then the equiv- 
alent noise power when the gain is G, is 


= 0 

47 
in which G, is the gain for which the SNR is y and the noise power 
is o°. The equivalent SNR is determined by 


se eG ns 
vig) = 71 FE (48) 
In the case where the noise depends upon the gain, K, (G = 20 log, K), 
both the quantities y and a in (29) or (85) depend upon gain. Hence, 
the optimization condition must be found by differentiating (29) or 
(85) with respect to K (or §) rather than a and setting that derivative 
to zero. However, this condition is generally too complicated to be 
useful and it is usually better to evaluate (29) or (35) for various § 
to determine the optimizing value. For the general case of the incoherent 
receiver the normalization for R in (29) is with respect to o, rather 
than o. In addition y, and o, must be used. Define 


a, = ¢/{Ko.()] _ 49) 
and 
Oe eee 
Then 
_ 2} F.(G.) |] 1 + wy’? |? 
coo - feo} . 





= e+ 19°82 Ea 
oV2 Vit py? F(G) J ° 


From (88), (47), and (48) one can find 


= (0) — —HBY(G) _ 
+O = Ty ei@r =) 
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In order to find the optimum receiver gain (47), (48), (51), and (52) 
are used in conjunction with 


l= 1 - Q4.G), V2 4.(6)] + Q17.(¢), dvV2 4.6)] (53) 


which must be minimized with respect to G. It is easiest to use a numer- 
ical method which requires only successive evaluation of (53) for various 
values of S, as opposed to methods requiring analytical evaluation of 
the derivative of (53). In the important case where only a finite number 
of gain settings G;, (¢ = 1, 2, --- N) are possible, minimization of 
(53) is easy requiring at most N evaluations for any given set of pa- 
rameters. Likewise when —/ is a unimodal function of § any of various 
search methods can be used.’° 


VIII. SUMMARY AND COMMENTS 

This paper considers the general problem of determining optimum 
receiver gains for radar and communications receivers. Dynamic range 
bounds and receiver gains are determined which yield minimum average 
cost of excluding fluctuating signals in noise. The analysis is general 
enough to include minimum exclusion probability as a special case 
as well as a range of fluctuation models including Rician, Rayleigh, 
and nonfluctuating cases. The analysis is applicable to both linear 
and nonlinear receivers and has important implications for certain 
radar processors and communications terminals which can use sophis- 
ticated techniques for signal and noise parameter estimation. The 
results are presented in a concise normalized form making them ap- 
plicable to a wide range of signal, noise, and channel conditions. It is 
shown that the optimum receiver gain is relatively insensitive to cost 
ratio for —D S van S 2D differing at most by about 3 dB from the 
optimum gain for y = 1. The effect of noise introduced by the gain 
adjustment cascade is discussed. 

The analysis presented assumes that certain signal, noise, and channel 
parameters are known to the receiver. In practice the receiver would 
be required to estimate these parameters. When these estimates are 
good, performance of the system will approach that described here. 
An extension of this work is to study both the optimization problem and 
the deterioration in performance when the parameters are not known to 
the receiver. Optimum dynamic range utilization for various coherent 
and partially coherent receivers can also be studied. 
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Floating-Point-Roundoff Accumulation 
in Digital-Filter Realizations 


By I. W. SANDBERG 
(Manuscript received June 20, 1967) 


In this paper, several results are presented concerning the effects of 
roundoff in the floating-point realization of a general discrete filter governed 
ideally by a stable difference equation of the form 


M N 

WV, = > bitn-k — a Oy Wnr-k » n2N (1) 
k=0 k=1 

in which {w,} and {x,} are output and input sequences, respectively. 

In particular, for a large class of filters 1t ts proved that there is a func- 
tion f{(K) with f(K) — 0 as K — © and a constant c, both dependent 
on the b;, the a,, the order in which the products on the right side of (1) 
are summed in the machine, and t, the number of bits allotted to the mantissa, 
such that 


()e S cy)x + f(K) 


for all K = N, in which, with {y,} the computed output sequence of the 
realized filter, 


Qk = (A= > | Yn ) 


and 





x = bet j 3 | Wn — Yn Be 


Bounds on f(K) and c are given that are not difficult to evaluate, and which, 
in many realistic cases, are informative. For example, for the second-order 
bandpass filter: 

Wy = Lq — AyWa-1 — AWa-2 , n2=2 (2) 
with a, and az chosen so that its poles are at approximately + 45° and 
at distance approximately (but not less than) 0.001 from the unit ctrcle, 

1775 
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we find that c, an upper bound on the “asymptotic output error-to-signal 
ratio’’, is not greater than 0.58 X 107*, assuming that t = 27, that the 
terms on the right side of (2) are summed in the machine in the order in- 
dicated (from right to left), and that the x, in (2) are machine numbers. 
If the x, are not machine numbers, and hence must be quantized before 
processing, then ec S 0.76 X 107%. . 

In addition to error bounds, an inequality is derived which, tf satisfied, 
rules out certain types of generally undesirable behavior such as self- 
sustained output limit cycles due to roundoff effects. This inequality 
ts satisfied for the example described above. 


I. INTRODUCTION 


The difference equation 
M N 
UW, = > DO Bijae c= >> AxpWn-k ; n = N (1) 
k=0 k=1 


with M s N defines the behavior of a general time-invariant discrete 
filter which acts on an input sequence 2, %,, %, °°: to produce an 
output sequence Wy, Wy11, Wy+2, °°: that depends on the starting 
values Wy), Wi, °°* , Wy-1- 

There is a vast literature concerned with techniques for designing 
discrete filters [i.e., for determining the a; and the b; in (1)] to meet 
specifications of various types (see, for example, Refs. 1, 2, and 3), and 
a good deal of material is available on the subject of roundoff effects 
in fixed-point realizations of discrete filters (see, for instance, Refs. 4 
and 5). In this paper, we derive some bounds on a meaningful measure 
of the overall effect of roundoff errors for discrete filters realized as 
digital filters on a machine employing floating-point arithmetic oper- 
ations. This type of realization, as opposed to the fixed-point kind, is 
of particular importance in connection with, for example, digital com- 
puter simulations of systems, as a result of the large dynamic range 
afforded by the floating-point mode. 

There are basic differences concerning fixed-point and floating-point 
error estimation problems which stem from the fact that the modulus 
of every individual arithmetic error in the fixed-point mode is bounded 
by a constant determined by the machine, whereas the maximum 
modulus of the error in forming, for example, the floating-point sum 
of two floating-point numbers is proportional to the magnitude of the 
true sum. For this reason, the approach* presented here, as well as the 
"* The approach can be extended in several different directions. For example, 


it can be used to obtain statistical error estimates based on the assumption that 
each roundoff error is an independent random variable. 
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character of the results, are quite different from those of earlier 
writers concerned with fixed-point realizations. 

In addition to error bounds, an inequality is derived which, if 
satisfied, rules out certain types of generally undesirable behavior 
such as self-sustained output limit cycles due to roundoff effects. 


II, ASSUMPTIONS AND RESULTS 


2.1 Assumptions 
It is assumed that: 


(¢) each machine number gq is equal to sgn (q) a 2” in which 
the exponent b is an integer, and a, the mantissa, is a t-bit number 
contained in [3, 1] or [3, 1] U {0}; 

(it) the range of values of b is adequate to ensure that all computed 
numbers lie within the permissible range; 

(2iz) the machine operations of addition and multiplication are per- 
formed in accordance with standard rounding conventions* (described, 
for example, by Wilkinson®); and 

(iv) the coefficients a, and 6, in (1) are machine numbers.t 


2.2 Results: x, Machine Numbers 


It is assumed throughout Section 2.2 that the x, of (1) are floating- 
point machine numbers. 
If the discrete filter (1) is realized on a floating-point machine, then 


M N 
Yn = fl > Cea p oe aunt) , n2ZwN (2) 

k=0 k=1 
in which the y, are approximations to the infinite precision numbers 
w,, and fl(2 — 2) denotes the machine number corresponding to 


(2 — ZY) with the understanding that the floating-point numbers cor- 
responding to the products b,%,_, and azYn_, are to be machine-added 


in some specified order. 
Let 


D@ +1+ > ne (3) 


* That is, conventions for which the first two equations of Section III are 
satisfied. 

y It is certainly true that preliminary design considerations may lead to coeffi- 
cients that are not machinc numbers, and one may then be interested also in 
the overall effect of approximating the coefficients by machine numbers. That 
problem also can be treated with the approach used here. 
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let 





(Ox = (4 Ya r) 


+1; 
for every sequence {q,} and all K = 0, and let e, denote the nth error 
(y, — w,) forn = 0. 


Our first result (all proofs are given in Section III) is as follows. 
If D(z) ¥ O for | z| = 1 [i.e., if the discrete filter (1) is stable], then 


tw\-l 1 — 2 ; 
Ons mex [DO (TE Ia) 





=—t nad ta\—l1l 1 = ) 
ne (25 11 8) max (De) (gis Ys! 


N 
“(3 Jaan) max | De | (hs a) 
k=1 Osws2r 
for all K = N, in which, with y, = w, = 0 forn < 0, 
N 
m= Dy (Yaz — Wns) n=0,1,2,---,W-—-1) 


and the a, and #, are easily evaluated nonnegative numbers which 
depend on the order in which the products in (2) are summed. 

Since the first term on the right side of (4), which arises as a result 
of the possibility of differences in the starting values, approaches zero 
as K — o, we see that, after a reasonable number of evaluations of 
the successive y, , (€)< is bounded essentially by a constant times the 
root-mean-squared value of the input sequence, plus another constant 
times the root-mean-squared value of the output sequence. 

In order to determine the a, and 8,, we draw a signal-flow graph 
that indicates the ordering of the operations that would be used to 
compute 


M N 
f (> Ohi >»; aut) (5) 
k=0 pel 


if x, and y, were unity for all n. This graph is to contain an input 
node with input bj for each b, ~ 0, an input node with input aj for 
each a, ~ 0, no other input nodes, and a single output node 6 which 
is associated with 


M N 
yh Sal. 
k=0 k=1 
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All other nodes represent an addition or subtraction of two signals 
to produce a third signal. Exactly one branch is connected to each 
of the input nodes and to the output node. We assign the value p to 
all of the branch transmissions with the exception of those branches, 
if any, which terminate on an input b/ or af for which b; or a, , respec- 
tively, is equal to unity. These branches are assigned unity transmission. 
Then, by inspection, we evaluate the signal at 6, which must clearly 
be of the form 


M N 
b/ ge p(k) + a’ ga (k) 6 
yy kp 2m iP (6) 


in which g,(k) and ¢,(k) are positive-integer valued functions. In 
terms of these functions* 


B, = (1.06)g5(k) 
a, = (1.06)¢,(k). 


For example, if the right side of (2) is computed as the floating-point 
difference of the machine sums 


fl(box, + Byty-1 Fees + by2n— a1) 
and 
fl(aiyn-1 or ag) Pe sc AnYn-N) » 


each obtained by performing machine summations in the order indicated 
(from left to right), if all of the 6, and a, are nonzero and not unity, 
and if I = 1 and N 2 2, then the relevant flow graph is shown in 
Fig. 1, from which it follows that 


Bo = (1.06)(Z + 2) 
1 = (1.06)(7 + 2) 
6, = (1.06)(8 + M — k); k 
a, = (1.06)(NV + 1) 
2 = (1.06)(N + 1) 
a, = (1.06)(8 + N — k); k = 3,4,--- ,N. 


The bound (4), although revealing, requires a knowledge of both 
(t)x and (y)x and is, therefore, not as explicit as we would like. 


R 


I 


Diy Oy Og ML. 


R 


* We are assuming here only that max |Jpe(k) 2? < 0.1 and max |pa(k) |2-? 


< 0.1. Also if pg(k) = 1, then we can take 6; = 1, and similarly for pa(k). 
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bo a 





Fig. 1— Flow graph for the example. 


For the important case in which b) ¥ 0 and N(z) = >o™, bie * ¥ 0 
for |z| = 1 (ie., for the minimum-phase filter case) we prove that 


if the filter (1) is stable and if 


M 
min | N@‘’) | > 27 > |b. |B. 


OSws2r 


(7) 


then there exists a constant c, independent of K, and a function f(K) 


with the property that f(K) — 0 as K — © such that 


(dx S cly)x + {U) 
for all K = N. Moreover, it is proved that 


Osws2T 


e<27' max er Bian Dy | be | Be 


max | Dee’*)/NCe’**) | + max | N(e*’)7* | 27° do | a eal 





1 — 2+ Do | by | By max | Ne“) | 
and 
to\-l ] = 2 ; tw\—-1 
fUX) S max | De’*) (Gag Ela!) + max | De ) 


(8) 


(9) 


| 27° 
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1 
2 





’ = 1 N-1 
¥ max |e" |(ZESE | a) 
I 1 =f) 
(62 | bi | Bx epee Sat Sees (10) 
ms 1-2°> |b. |B, max | N(e**)7? | 


for all kK 2 N, in which, with ag = landz, = y, = 0 forn < 0, 
N M 
Qn = o> QnYn-k “< > peers 
k=0 k=0 


forn = 0,1, 2,---,(N — 1). 

Since (y)x is the root-mean-squared value of the computed output, 
and since f(K) — 0 fairly rapidly as K — ©, we may interpret the 
smallest value of c for which (8) is satisfied (for all input sequences) 
as an “output error-to-signal ratio” of the realized digital filter. Note 
that the bound (9) on ¢ is not difficult to evaluate. 


2.2.1. Stability in the Presence of Roundoff 


If roundoff effects are ignored, it is well known that the discrete 
filter is stable in several different senses of the word if D(z) ¥ 0 for 
|z| = 1. In Section III it is proved that, with roundoff effects taken 
into account, the digital filter is stable in the sense that there is a 
constant c, and a function f,(K), with f,() independent of the values 
of x, for n 2 N and f,(K) > 0as K —> &, such that 


(yx S a(x)x + f,(K) (11) 
for all K = N, provided that D(z) # 0 for |z| 2 1, and 


N 
min | De‘*) | > 27 DD | a, | oy (12) 


Roughly speaking, inequality (12) is satisfied if the damping of the 
infinite precision counterpart of the digital filter is sufficiently large 
relative to the number of bits allotted to the mantissa. Stability in 
the sense of (11) rules out, for example, the possibility, due to roundoff 
effects, of a limit-cycle response to a zero input sequence or to an 
input sequence {,} that approaches zero as n > 0©.* 


* There are simple examples which illustrate that instability may result with 
D(z) ¥ 0 for |z| = 1 if (12) is not satisfied. For instance, suppose that each machine 
number is represented in the form (—272° + mj27) + m27? -F +++ -F m27*)2? 
with the m; zeros or ones, and t > 1. Let 

Wn = (1 — 2-¢)waa + (1 — 27*)2-'w,_2 for n = 2, with w = wi = — 1. 
Then f[(1 — 2-!wi) = —(1 — 2>*), fll — 274)2-two] = —(1 — 27*)274, and 
fl[—Q — 2~*) — (1 — 27-*)2-4] = —1, which shows that the computed approximation 
Yn to Wr Satisfies yn = —1 for alln = 0. This example is a slight modification of one 
suggested by S. Darlington. 
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2.3 A Result Concerning the Overall Effect of Input Quantization Errors 


In many applications the sequence {x,} of (1) is obtained by quantiz- 
ing an input sequence {,} [i.e., by replacing each #, with the machine 
number (or one of the possibly two machine numbers) of closest value]. 
The infinite precision response wy, Wy+,, °°: to the sequence {zy} 
satisfies 


UW, = 


Ms 


N 
b.En—% — > AyWr-k 7 n= N (13) 
k=l 


tow 
il 


te) 


with wW,., %,, --* , ®Wy_, some set of starting values. Let wy , Wy41,°°° 
be defined by (1) with w, = w, for n = 0, 1, 2, --- , (NV — 1). It is 
clear that (y — w)x, the root-mean-squared value of the difference 
of the computed output and the infinite precision response to {Z,}, 
satisfies 


y — Oe Sy — w)e + (w— W)x. (14) 


Bounds on the first term on the right side of (14) are given in Sec- — 
tion 2.2. In Section III it is proved that if both N(z) and D(z) have 
no zeros on or outside the unit circle, b) ¥ 0, and 


M 
min | N(e"*) | > 27° 2, | be |B, 
then* there is a constant c. and a function f,(K) such that f.(K) — 0 


as K — o, and 


(w — @)x S ely)x + f2(K) (15) 
for all K 2 N. It is proved also that 


M 
Co < 27 » | b, | max | D(e**)7? | 
k=0 co) 


max | D(e’*)/N¢e'*) | + max | N(@*)™* | 27 x | a, | ax 
— Af : a . (16) 
1—27* D7 |b, | 6, max | Ne’) | 


2.4 A Realistic Example 
For the ideally stable second-order bandpass filter 
Wy = Lp — AyWa-1 — A2Wp-2 , n2z2 


*It is assumed here that the range of values assigned to the mantissa includes 
the number zero. 
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with poles in the z-plane at angles % +45° and at distance ® 0.001 
(but not less than 0.001) from the unit circle, we have a, Y —1.41, 
a, © 1, and min, | D(e’*)~*| = (0.00141)7*. We assume that the 
operations are performed as indicated in Fig. 2, so that B) = 1, a, = 
3(1.06), and a, = 3(1.06). Assuming that ¢ = 27, we find that ¢ our 
bound on the “asymptotic output error-to-signal ratio,’ ignoring 
input quantization effects, is approximately 0.584 xX 10™*. For this 
problem, our bound on ¢ is approximately 0.18  107*. Thus, even 
taking into account input quantization effects, the error-to-signal 
ratio is not more than 0.764 X 10°*. Finally, a simple calculation 
shows that this filter is stable in the presence of roundoff, in the sense 
of inequality (11). 


Ili. PROOFS 


3.1 Derivation of Inequality (4) 


If a and Bb are floating-point machine numbers, then the floating- 
point product and sum fl(ab) and fl(a + b), respectively, satisfy° 


fl(ab) = ab(1 + ¢) 
fi(a + b) = (a+ &)(1 + 8) 


with | «| S 2°‘ and | 6| S 27’. Thus, 
M N 
10> beter — Ds autos) 
k=0 k=1 


is equal to the value of the output signal 6 of the flow graph described 
in Section II with 
(4) bf = Detar 


QL = UYn—k 





Fig. 2— Flow graph for the second-order band-pass filter. 
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and 


(iz) each of the branch transmissions of the form: (1 + e) with 
|e«| S 2°‘ (recall that in certain special cases ¢ is taken to be 
zero), or —(1 + ¢) with | «| < 2°‘. Therefore, 


M N 
fi( Otek — oD autor) 
k=0 k=1 


is equal to 
M N 
>» bp Vn—K Ve oe > QxYn—Kl 
k=0 k=1 
in which 
deze eta ye” (17) 
and 
CT a Dee See a) (18) 


Inequalities (17) and (18) imply° 
1 ~ (1.06)gs(k)27' S q, S 1 + (1.06) gp (k)2° 
1 — (1.06)o.(K)2°' Sr, S$ 1+ (1.06)¢.(k)2™ 


lA 
lA 


provided that 2~* max; gs(k) < 0.1 and 27’ max, g,(k) < 0.1. 
Thus, forn = N 


M N 
Yn = fl bptn—p 7 De tutn-t) 
k=0 k=1 


(19) 
M N 

— De bitcs =>. oF OnYn-k + Qn 
k=0 k=l 


with 
M 


S28 DE | de || tune [Bef 2 De | ae || dane | oe (20) 


k=0 


| 


and 
B. = (1.06) gg(h), a, = (1.06)¢.(k’. 


Using (1) and (19), 


N 
ay GyCn-k = Nn s nr = 0 
k=0 
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in which, with y, = w, = 0 forn < 0, 


N 


Nn = BS A(Yn-k —* Wn) 


k=0 
for n = 0,1, --- , (N — 1). By Propositions 1 and 2 (see Sections 3.5 
and 3.6) 
(eye S max | De’) | (nx , Kk 20. (21) 
OsSws2r 


By Proposition 3 (Section 3.7), inequality (20), and Muinkowski’s 
inequality 


1 N-1 a‘) 
4 s I 11 n 
M N 


-! LS aa ai 
+ 2 Yi ladgiy taf) +2 Dela, | oxtyx (22) 


k=0 k=1 


for all K = N. This proves inequality (4). 


3.2 Inequality (8) 


Here we assume that both D(z) and N(z) are zero free for | z| 2 1, 
that bo # 0, and that 
ML 
mom, |e) OS | by Bik (23) 
OswsS2r k=0 


From (19), we have, with ay = 1, 


N M 
> AY n-k = os Dink + An ’ n = 0, (24) 
k=0 k=0 
where 
Qn = Mn n2wN 
N M 
= DY ane — D> Dink ; n=0,1,2,-°-,W-—1) 
k=0 k=0 


with x, = y, = 0 for n < 0. Therefore, by Propositions 1 and 2, 
(x) = max | D@e'*)/N(e"*) | yx + max | NE)" | Qe, K 2 0. 
(25) 
Using Proposition 3, Minkowski’s inequality, and (20), 


1 N-1 : i 2a M 
es Gy Slat) ete Do | be | Bale) 


+2° SM lalaty., KEN. (26) 


k=1 
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Therefore, 
. : * N 
max | D(e**)/N(e*) | + max | Ne’)? | 27! DS | a | 
w @ 


()6 > Ue 
1-2 D7 |b | 6, max | NG)" | 
=0 w 


IIA 


1 
z 


me vey" (RE lat) 





M (27) 
1—2° DS b, | 8, max | Nee)" | 


for all K = N, which together with (21) and (22) yields 


IIA 


(e)« 


Osws2r k=1 


2° max | De’) | i | a, |o, + pa | b, | B: 


max | D¢e’’)/N(**) | + max | N(e**)™* | 2° x | ay | «| 
ooo (y)xc 
1-2 Sle famaxiver| | 


N-1 } 
+ max | De" | (Gy S| mf) + max | De | 2 


K ae 1 n=0 
we) (LS oP) 
ae max | N(e**)7" ( = In ) 
K+if 
ee | bi | Bx Dee (28) 
Pe 1—2°* >> | bd, | 6, max | Ne’)? | 


This proves that there exists a constant c and a function f(K) with 
the property that {/(K) — 0 as K — © such that (8) is satisfied for all 
K = N, and of course it also proves that c and f(K) are bounded as 
stated in Section 2.2. 


3.3 Proof of (11) Under the Conditions Stated 
From (24) and Propositions 1 and 2, 


(ude $ max | N(e'*)/D(e*) | (a)x + max | De)" [Ka)e , 


and using (26) 
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max | N(e'*)/D(e'*) | + max | D@**)"' | 2°' Do | by | Be 
Vea = 7 = (x) 
1—2°' }* | a |e, max | De’) | 


k=1 





1 


max | De*y" (goa & 1a) 


N 
1— 27 >) | a | a, max | Dée**)”* | 
k=1 3) ; 








+ 


which completes the proof. 


3.4 Derivation of Inequalities (15) and (16) 
We have, from (1) and (18), 


N 


Ay, (Wx .- Wnt) i En ’ n= 0 (29) 


k=0 


cs . b 
in which a, = 1, 


IIV 
= 


M 
é, = De Uy (ta-- a Vici) n 


k=0 
and 
£, = 0, m=O, 92,78 WW = ys 


Since , = sgn (€,)h2° for some integer b and some h ¢ [3, 1] (assuming 
that «, + 0), the magnitude of the error in approximating <, by the 
closest machine number x, = sgn (€,)a2” is at most $27'2” = 327‘a™* 
| 2, | S$ 2°‘ | 2, |. Therefore, forn = N 


M 


ey 2"" >, | be [2 Bee [ 


k=0 


and by Propositions 1, 2, and 3 


(w — W)x < max | D(e’*)7* | 27° 


|, KN. (0) 


M 1 K , 
Zlallegy Zz lel). eR 


aN— 


From (80) and (27) 


M 
(w — we S2°* Do |b | max | Dee’) | 
k=0 a 
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N 
max | D(e’’)/N(e'’) | + max | N(e"’)"* | 27 2 | az | a 











= (YK 
1-—2° = | by | 6, max | Ne’) | 


1 


Mf 
at S|, | max | De)? | max | Me" | (Fes D lal) 








-{- 





M 
1—2°' >° | bz | B, max | N(e"*)™* | 
k=0 w 
for all K = N, provided that N(z) ¥ 0 for | z| = 1, bo ¥ 0, and 
: AL 
min | N@’) | > 27 S| be | Be - 
rm k=0 
This completes the derivation. 


3.5 Proposition 1: 
If 


L L’ 
>> Cia = >» d1Sn—-1 + 1 ' n2z0 
1=0 1=0 


with: 7, = s, = 0 forn < 0, c& ¥ 0, and >. c27' ¥ Ofor|z| =], 
then 


IV 
o 


n n 
=— oD Un-1S8k + 3S Un-ebe ; n 
k=0 k=0 


in which 


ce} 0 
Do tte eey, Dope ae 
n=0 n=0 
co ; L’ ; L ; 
> ue = ay y cet’, 
n=0 t=0 7=0 


and 


oo L 
nem = 1 / Soe 
n=0 t=0 


forO0 Sw S 2z. 
Proof :* 
*The proof of this result, although rather trivial, is included because the 


writer knows of no reference where it is proved without the assumption that the 
sequences {s,} and {fn} are z-transformable. : 
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Let AZ > 0, and let 


§ =s, for ns M 
=Q0 for n>WM 
i =f, for ns M 


= 0 for n>dJ. 


Then r, = #7, forn S M, with 


L L' 
a CiPn-1 = a dSn-1 + 1s ’ n = 0 
t=0 t=0 


and with {#,}, {s,}, and {f,,} z-transformable. Therefore, we have 
Re) = “ke os C2 8 S@ + 3 cet) 1X6) 
1=0 
in which 


R® = Mae" 
n=0 
* M 
S@® = ds2" 
n=0 
- AML 
FE — dX fe”. 
Thus, 
= Dota + Doerr, 120 
k=0 k=0 
and hence 
= Dis, + QD meade » (31) 
k=0 k=0 


forn = 0,1 - , AT. However, since JZ is arbitrary, (31) is satisfied 
for all n = 0. This proves Proposition 1. 


3.6 Proposition 2: 


If 
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which proves Proposition 2. 


3.7 Proposition 3: 
If 
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This completes the proof. 
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An Approach to a Unified 
Theory of Automata 


By J. E. HOPCROFT* AND J. D. ULLMAN 


(Manuscript received April 7, 1967) 


A model of an automaton, called a balloon automaton is proposed, It 
consists of a finite control, which may be deterministic or nondeterministic, 
an input tape which may be one way or two way, and an abstract, infinite 
memory, called the balloon, which can enter any of a countable number 
of states. There is assumed to be a recursive function which manipulates 
the state of the balloon, and another which passes a finite amount of in- 
formation from the balloon to the finite control. 

A subset of the balloon automata is considered a closed class tf it obeys 
two very simple closure properties. Certain closed classes recognize exactly 
the languages recognized by such familiar automata as the pushdown 
automaton or stack automaton. Unfortunately, no closed class recognizes 
the sets accepted by linear bounded automata or the time and tape com- 
plexity classes of Turing machines. 

It is shown that many of the usual closure properties of languages 
accepted by the pushdown automaton, stack automaton, etc., hold for an 
arbitrary closed class of balloon automata. For example, the languages 
accepted by a closed class of one-way, nondeterministic balloon automata 
are closed under concatenation. Of special interest is the fact that a closed 
class of two-way deterministic balloon automata is closed under inverse 
g.s.m. mappings. This fact 1s not obvious, and was not known for all 
of the types of automata which form closed classes of balloon automata. 

It should be emphasized that the purpose of this paper ts not to propose 
another ‘‘model of a computer.”’ Rather, we are proposing a method of 
proving the standard theorems about existing and future models. Hope- 
fully, when a model is proposed in the future, one will simply show tt 
equivalent to a closed class of balloon automata, and have many of the 
closure properties automatically proven. 


* Currently at Cornell University, Ithaca, N. Y. 
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I. INTRODUCTION 


In the past, and especially recently, people have been examining 
various species of automata, perhaps as models of the compiling and 
translating processes, or for the insights they lend to computation. A 
partial list includes the Turing machine,? pushdown automaton,” * * 
deterministic pushdown automaton,’ counter machine,®* stack auto- 
mation, in all its forms, two-way,® one-way,” 1° 11 nonerasing,?? de- 
terministic and nondeterministic, the nested stack automaton,’* and 
the time’* > and tape?® 1748 bounded Turing machines. This list is 
not meant to be a complete survey of past writings, and more can 
be expected in the future. 

Many of the properties of each of the automaton classes mentioned 
are the same. For example, one would expect the set of languages 
accepted by each class to be closed under intersection with a regular 
set. Our plan is to propose a model of an automaton abstracting the 
common features of most of the models mentioned. We will define a 
class of automata to be a subset of the set of all such automata if 
it satisfies certain simple and physically meaningful closure properties. 
Then, from these closure properties, we will derive many of the 
common closure theorems which have been proven for the specific 
types of automata mentioned, and which, presumably, would be 
proven for future types. 

The basic model is shown in Fig. 1. It consists of a two-way input 
tape, with end markers, a finite control, and an infinite storage of 
unspecified structure, called the balloon. 

We assume that the states of the balloon are represented by the 
positive integers. A move of the automaton is a three-stage process. 
First, a recursive function is used to get a finite amount of informa- 






INPUT HEAD 


FINITE 
CONTROL. 


Fig. 1— Balloon automaton. 
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tion from the balloon. Typically, this information is analogous to the 
symbol scanned by the storage head of an automaton with a tape 
memory. Second, based on the information from the balloon, the state 
of the finite control, and the symbol scanned by the input head, a new 
state of finite control and a direction of input head motion is de- 
termined. Third, based on the new state of finite control, and the 
current state of the balloon, a recursive function determines the next 
state of the balloon. Certain states of the finite control are final 
states. If the input causes the automaton to enter a final state, the 
input is accepted. 

A subset of the set of balloon automata is called a closed class, or 
simply a class, if: 


(1) It contains the finite automata. 

(1) If two automata are in the class, a third in the class can be 
found by associating in any way, the recursive functions getting 
information from the balloon and determining the next state of the 
balloon. 


The latter condition is vague, but will be made formal. 

Most, but not all, of the types of automata mentioned can be 
interpreted as classes under our definition. It seems that a type of 
automaton is a class if its definition involves only the ways in which 
the infinite storage may be locally manipulated. Sets such as the time 
and tape complexity classes of Turing machines do not form classes. 
With special emphasis, the linear-bounded automata unfortuately do 
not form a class in our formulation. Note that single changes in the 
next state of finite control function for a Turing machine may cause 
it to use much more time or tape than did the original machine, so 
condition (72) would not be satisfied. Some of the automata, all two- 
way deterministic, which do form classes are: 


(1) Pushdown automaton. 

(71) Stack automaton. 

(21) Nonerasing stack automaton. 
(2v) Nested Stack automaton. 
(v) Single counter machine. 

(vt) Finite automaton. 

(vw) Turing machine. 


Our model shall be modified to treat nondeterministic and one-way 
input devices later in the paper. We have chosen two-way determinis- 
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tic devices to treat first because, with one exception, the theorems 
involved are quite straightforward. 


II, THE TWO-WAY DETERMINISTIC BALLOON AUTOMATON 


A balloon automaton consists of: 


(z) A finite, nonempty set of states, S. 

(iz) A finite set of input symbols, I, which includes ¢ and $, the left 
and right end-markers of the input, respectively. 

(az) A set of balloon states, which is always the positive integers, 
denoted by Z. 

(iv) A finite, nonempty set of integers, J7, known as the balloon 
information. 

(v) A total recursive function, h, from Z to JZ, known as the balloon 
information function. 

(vt) A function g, with finite domain, S X I X M and finite range 
SX {-1, 0, +1}. We will also allow ¢, the null set, in the range of g. 
We call g the finite control function. 

(viz) A partial recursive function, f, from S X Z to Z, known as 
the balloon control function. 

(viii) A subset, F’, of S, called the final states. 

(ix) A state go in S, the start state. To simplify matters later, we will 
here assume that the start state is not a final state. The balloon automaton 
is denoted (S, J, AZ, f, 9, h, qo, F). 


We denote a configuration of the automaton A = (S, J, M, f, g, h, qo, L) 
by (q, w, j, 7), where: 


(z) gis a state of the finite control, in S. 

(iz) w is in I*. More specifically, w = ¢a,a, --- a,$, n = 0, where 
forl <k Sn,a,isinJ — {¢, $}. Thus, ¢ marks the left end and $ 
the right end. We call n the length of w. Endmarkers do not contribute 
to the length. 

(c2t) 7 is an integer between 0 and n + 1, denoting the position of 
the enput head of A. 

(iv) tis a positive integer, dies: state of the balloon. 


As previously mentioned, a move of A is a three-stage process. Let 
(q., W, ji, %1) be a configuration of A, and the j,th symbol of w be a. 
Let w, exclusive of endmarkers, consist of n symbols. We call ¢ the 
Oth symbol, $ the n + Ist, and number the non-endmarker symbols 
from 1 to n from the left. Suppose h(7,) = m. Then, find g(q,, a, m). 
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If it is y, no move is possible. Suppose g(q,, a, m) = (g2, d), where 
gz isin S andd = —1, 0, or +1. Then, compute, if possible, f(q2, 21). 
Let it be 7, . If 7, = 7, + d lies in the range 0 to n + 1, we say that a 
move is possible, and the next configuration is (q2, Ww, jo, %2). 

Note that f(q2, 7,) does not necessarily have a value. In that case, 
there is no move possible. 

Intuitively, to make a move of A, we get what information we can 
from the balloon by calculating A(z,). Then, using g, we find the new 
state of finite control and direction of motion of the input head. Finally, 
using f, with the new state of finite control, we find the new balloon 
state. 

If, from configuration (q,, w, ji, 7), the next configuration of A is 
(qo, W, jo 12), We Say: (qi, W; fry 21) [a (Gey W; Jo, 22). If A can go from 
configuration (q,, W, j: , 21) to configuration (q2, W, jo, 22) by some 
number of moves, including zero moves, we say: (q: , W, ji, %1) [4 (@, UW; 
Je ’ 12). 

Notation: We will, for a balloon control function f and state q 
in S, often use f,(z) for f(q, 7). Also define a” to be the function from 
Z to Z such that a‘ (¢) = 7 for all 7. Let a‘”, for integer j = 1, be 
the function that takes 7 to 7 for all 7 in Z. . 

If A = (S, J, M, f, 9g, h, qo, F) is a balloon automaton, let the tapes 
accepted by A, denoted T(A), be the set of w such that 


* . . 
(Go » W, 0, 1) la (q, W, vi 1) 


for some q in F, input head position, j, and balloon state, 7. That is, 
starting in the start state with the input head at the left endmarker 
and the balloon in state 1, w must cause A to enter an accepting state. 

Note that if g determines, in some configuration, that A enters 
state p, and p is an accepting state, but for the state of the balloon, 
1, f,(2) is not defined, then A has no next move, hence does not accept. 

Let C be a subset of the set of all balloon automata. We say C' is 
a Closed class, hereafter shortened to class, if it satisfies the following 
two conditions: 


I. (S, 7, MW, f, 9, 2, do , ) isin C for any finite sets, S, 7, F C S,qin 8, 
and arbitrary mapping g from S X I X M to (S X {—1,0, +13) U {¢}. 
We restrict h to be a‘” for some j = 1 and M = {j}. Also, for each 
qin S, f, is a” for some k = 0. 

TT. Let (S101, Mi,f:,91:,41,%,5 Fs) and (S2, 12, M2, fo, g2, he, 
go, Fo) be in C. Then (S3, Js, Ms, fs, 93, hs, Qs, Fs) isin C if; 
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(¢) S3 and J; are arbitrary finite sets. 
(71) Ms is the range of hz. 

(277) g3 is in S3. 

(tv) F, CS De 7 

(v) g3 1s an arbitrary mapping from 


Ss X I; XM; to (8S; X {-1,0, +1}) U {e¢}. 


(vt) For each g in S3 , (fs)q is (f:), or (f2), for some p in S, or S2, 
respectively. t 
(vit) hz is a total recursive function such that if h3(¢,) * hs(t2) then 
either h,(t,) ¥ hy(z2) or ho(t4,) ¥ he(t2). 


Intuitively, assumption (7) causes each of the regular sets to be 
accepted by some automaton in the class. Note that the function h 
is such that no information can be obtained from the balloon. 

Assumption II insures that balloon control functions can be used 
interchangeably. The function associated with some state may be 
associated with none, one, or many states of a new automaton. 

The information obtainable from h; is no more than the information 
obtainable from the combination of h, and h.. 

If C is a class of automata, then the set of languages which can be 
recognized by some automaton in C is called a closed class of languages, 
or simply a class of languages. 

It should be clear that to every class, C, there corresponds a set 
of allowable balloon information functions, Hg . That is, a function, h, 
is in H¢ if and only if it is the balloon information function for some 
automaton, A, in C. Likewise, there is a set of functions, F¢, which 
is the set of allowable balloon control functions restricted to a single 
state. That is, f is in F¢ if and only if for some automaton A, in C, 
with balloon control function f,, f(t) = f,(g, 7) for some fixed state 
q of A. 

Note that a” is in Hg for all7 = 1, anda” fori = Oisin Fe, 


or any class C. We can use the following obvious result: 

Lemma 1: LethbeinH candf, ,fo,-++,fbeinF'¢. LetS= {qi ,G2,°°*, Qs}, 
I be an arbitrary set of inputs including ¢ and $, M the range of h, g 
an arbitrary map from S X I X M to (S X {-—1, 0, +1}) U {¢}, and 
FCS. Then (S, I, M, f, 9, h, a, F) is in C for any q. in S, and f defined 
by f(qi, 1) = fi) for all ¢. 

Proof: Let By = (So, Io, M, dc, go, h; po, Fo) be an automaton in C 


+ Recall (fs) is by definition the function such that (fs)q(7) = fs(q,t) for all 2. 
Likewise (f:)» and (fe)». 


UNIFIED THEORY OF AUTOMATA 1799 


with balloon information function h, and A; = (S;,JI;,M;,d;,9;,h:, 
p; , F;) be automata in C such that for each 7, 1 S 7 S s, there is a state, 
r;, in S;, such that d;(r; , 7) = f<(9) for all 7. 

For 1 S$ 7 S s, define B; from B;_, and A; according to rule II. 
Let B; = (S,I, M, e; ,9,h, g , F), where (e;),, = f; if7 S 7, and (e;),; = 
f, if 7 > 7. Surely, e, = f, so B, is our desired balloon automaton. 


Lemma 2: Let A = (S, I, AT, f, 9, h, Go, F) be an automaton in Class C. 
Let Ay = (81,11, M, fi, 91,4, G1, P:) be such that for every pin 8, , Gi) 
is either a for some i = 0 or f, for some gin S. Then A, is in class C. 


Proof: All f, , for gin S are in F,, and his in H,. Also, a” is in F¢ 
for allz = O by rule (J). A, is in class C by Lemma 1. 

We should comment that it is quite natural to force a‘ to be in 
F. for any class, C. Intuitively, the consequence is that an automaton 
may do computation in its finite control without affecting the infinite 
portion of storage. We also force a‘”, for 7 = 1 to be in Fc. These 
mappings enable us to reset the infinite memory to any given state. 
Their use will be apparent, but their justification is not so clear. We 
only observe that for any of the seven types of automata mentioned, 
suitable modifications, which do not change the power of the devices, 
can be made, so that a device can reset itself to a given state. 

For example, a Turing machine can surely erase its tape and print 
any given tape string thereon. Of course, it takes more than one 
move to do so, but this fact should not concern us. Even a nonerasing 
stack automaton can print a dummy “end of stack” marker at the 
top of stack to simulate an erasure of the stack. 


Example: Let us indicate how to interpret a two way deterministic 
pushdown automaton as a class of balloon automata. We will not 
give a formal definition here. Most readers should be familiar with 
the concept of an automaton with pushdown storage, usually taken 
to be nondeterministic, with a one-way input. The two-way, de- 
terministic variety is defined formally in Ref. 4. 

Informally, the infinite storage is a pushdown tape, of which the 
automaton can at any time read only the top symbol. The pushdown 
tape can be altered by erasing the top symbol, or by adding a symbol 
to the top of the list.f The pushdown automaton has a finite control, 
input tape and input head, similar to these portions of a balloon 
automaton. 


+ The model of Ref. 4 allows one to add any finite number of symbols, but this 
mode is equivalent to adding one at a time. 


1800 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1967 


We shall not formally prove that there is a closed class of balloon 
automata accepting exactly the sets accepted by two-way, determin- 
istic pushdown automata. We shall merely give the sets He and Fr 
of balloon information and balloon control functions, and indicate 
how they reflect the pushdown structure of storage. We shall also 
indicate how any balloon automaton in the class can be simulated by 
a two-way, deterministic pushdown automaton. 

To begin, we shall assign the usual Gédel numbering to pushdown 
tapes. That is, let the allowable pushdown symbols be Z,, Z2, +--+, Z,, - 
Represent the pushdown list Z;,Z;, «+: Z;, by 2°°3°°5" +++ [x(k)]". 
Here z(z) stands for the 7th prime. (r(1) = 2, 7(2) = 3, (8) = 5, etc.). 
Define u(z), for 7 ¥ 1, to be the number of the largest prime dividing 7. 
and define «(z) to be the number of times z(u(7)) divides 7. Let (1) = 0: 
x(1) also is 0. For example, u(75) = 3, because the third prime, 5, 
is the largest prime dividing 75. «(75) = 2, since 5 divides 75 twice. 

Define F to be a set of recursive functions given by: 


(7) a, for alli = Oisin F. 

(i) For any integer, d, the function f, defined by f(¢) = i[r(u(z) + 1)]" 
« = 1, is in F. Note that f(z) finds the prime above the largest prime 
dividing 7, and multiplies 7 by that prime, raised to the power d. 

(ci7) The function f, given by f(1) is undefined, f(7) = i/[r(u(2))]""”, 
2 > 1, isin F. This function divides 7 by the largest prime dividing 27, 
as many times as it divides 7. 


The set H¢ includes a” for 7 = 1. H¢ also includes any total recursive 
function h if there is an integer d such that h(z) ¥ h(j) only if x(2) ¥ x(q), 
and at least one of «(z) and x(j) is equal to or less than d. 

Let a given pushdown automaton, P, have m pushdown symbols, 
Z,,422,°**,Z,. We will find a balloon automaton, A, whose balloon 
information function is in H, and whose balloon control function for 
any given state is found in F. The balloon information function, h, 
will have h(i) ¥ h(j) if «(¢) ¥ «(j) for x(z) and «(j) each Sm. According 
to the Gédel numbering of pushdown tapes we mentioned, h(z) will 
always indicate the top pushdown symbol of the tape numbered 2, - 
provided tape 7 involves symbols Z, , Z2, ++: , Z, only. 

Based on the top pushdown symbol, the state of P’s finite control 
(which is carried in the finite control of A), and the symbol scanned 
by A’s input head, A can move its input head, and change state accord- 
ing to what P would do. A may then have to adjust its balloon state 
to simulate a change in P’s pushdown store. If P does nothing to the 
pushdown store, the function a“ serves. If P erases the top symbol, 
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the function f, in F by rule (277) must be used. If P prints Z; on top 
of the pushdown list, the function f, in F by rule (zz), with d = 7 suffices. 

There is a subset, C, of balloon automata defined by placing an 
automaton in C exactly if its balloon information function is in H 
and its balloon control function, restricted to any particular state, 
isin Ff. We claim that C is a closed class. Surely every balloon automaton 
defined by rule I of the definition is in C. 

In rule II, we have two automata, A, and A,, in C, and must show 
that a third automaton, A; , constructed from A, and A, is also in C. 
Certainly, the balloon control functions of A; are in F. Let h, and 
h, be the balloon information functions of A, and A., respectively. 
Assume h, and h, are in H. Let h; be the information function of A; . 
Suppose h3(z) #* A3(j). Then either h,(4) ¥ h,(j) or hot) ¥ h.(j), by 
rule IJ. In either case, x(z) # x(j). Also, since h, and h, are in H, we 
can find an integer, d, such that one of «(z) and x(j) is S d. Thus, h3isin H. 

Now we must show that any balloon automaton in C can be simulated 
by a two-way pushdown automaton. The details of simulating the 
finite control and input head of the balloon automaton can be left 
to the reader. We shall only discuss how the balloon can be simulated. 

Let A = (S, J, MW, f, g, h, W, F) be in class C. Some f, , for g in S, 
may multiply the ballon state, 7, by a prime raised to some power, d. 
Note that this prime cannot divide 7. Let d, be the maximum such d. 
Some f, may be a” for j = 1. Now, let d be the maximum number 
of times a prime divides j, and let d, be the maximum such d. Finally, 
let ds be max (d,, d,). 

The pushdown automaton, P, simulating A, will have d; -++ 2 push- 


down symbols, X, Z), 21, -:: , Za,. X will mark the bottom of the 
pushdown list. Por some k, each integer, z, can be expressed in prime 
factors as [r(1)]**[a(2)]"? --- [(k)]"*, where each z7;, 1 S j S k, lies 


between 0 and d; , but 2, # 0. Then 7 will be represented by pushdown 
tape XZ;,Z;, --: Z;,. It should be clear that if h(z) # hy), then the 
tapes representing 7 and j have different top (rightmost) symbols. 

Suppose A uses a balloon control function that is in F according to 
rule (277). Then P erases the top pushdown symbol. P must also erase 
from the top, any occurrences of Z,. Suppose A uses a balloon control 
function that is in F by rule (zz), with some particular value of d. 
Surely 1 S d S d;. P must print Z, on the top of its pushdown list. 
Finally, if A uses balloon control function a’, 7 = 1, P erases its 
tape down to X, then prints Z;,Z;, -+: Z;, on its stack, where 7 = 
(r(1)]**[r(2)]** +++ [r(k)]**. Note that by definition of d3, we must 
have 2; S d; for all j. 
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From the way F is defined, it is easy to show that for any automaton 
with balloon control functions chosen from F’, there is some d; , chosen 
as above, such that if the balloon can enter a state 7, then no prime 
divides 2 more than d; times. Thus P, as above, with d; + 2 pushdown 
symbols, can simulate the balloon of A. 


III. SOME THEOREMS ABOUT TWO-WAY DETERMINISTIC BALLOON AUTOMATA 


We have spent time defining closed classes of automata. Our goal 
is not so much to talk about the classes themselves, but rather about 
the properties of the closed classes of languages that they define. Let 
us begin with a not unexpected result. 


Theorem 1: Let A = (S, I, M, f, 9, h, qo, F) be a balloon automaton. 
Then L = T(A) ts a recursively enumerable set. 


Proof: We shall describe, informally, a Turing machine recognizing L. 
First, we have assumed f to be partial recursive and h total recursive. 
Hence, there is a Turing machine, 7, which, given a block of 7 1’s 
on its single tape will halt with h(z) 1’s on its tape. Likewise, let S = 
{Q@1, 2, *** » Ge}. Then there are Turing machines T,, 7., °°: , 7; 
such that given 7 1’s on its tape, 7’; will eventually halt with f,,(¢) 1’s 
on its tape if f,,(z) is defined, and not halt otherwise, foreachj,1 <j Ss. 

We will now construct a Turing machine, 7’, recognizing L, by simulat- 
ing A. 7’ is shown in Fig. 2. It has a read only input tape with end- 
markers, and two storage tapes. The first is used to store the state 
of the balloon of A. 

The second is used for the computation of h and f. The finite control 
of 7’ will store the state of A’s finite control. 

Initially, the input head of 7 is at the left endmarker. Its finite 
control records that A’s finite control is in state q,. Storage tape 1 







FINITE 
CONTROL 


¢ STORAGE 








¢ TAPES 





Fig. 2— Turing machine 7. 
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has a single 1 on it, corresponding to the initial state of A’s balloon, 
and tape 2 is blank. 

Suppose 7 has simulated some number of A’s moves with given 
input. That is, 7’s input head is at the same position as A’s would 
be after that number of moves. The finite control of 7 holds the state 
of A’s finite control, and tape 1 holds the state of A’s balloon. We will 
show how T simulates the next move of A, if A has a next move. 


(2) Copy tape 1 onto tape 2. 

(22) Simulate 7, on tape 2. When 7 halts, suppose there are m 1’s 
on tape 2 at that time. 

(722) Suppose 7 has recorded that q is the state of A’s finite control. 
The symbol scanned by 7’s input head is a. Then 7 moves according 
to g(q, a, m). If g(q, a, m) = ¢, T never completes simulation of the 
move of A. If g(q, a, m) = (p, d), T records p as the state of A’s finite 
control replacing g. 7’ moves its input head in the direction indicated 
by d. If to do so would cause the input head to leave the input, 7’ makes 
no move, but halts without accepting. 

(zv) If T has simulated the first two stages of A’s move, it again 
copies tape 1 onto tape 2. Let p be q; for some j, 1 S$ 7 S s. ThenT 
simulates 7’; on tape 2. If f,, is defined for the number of 1’s on tape 2, 
T; will eventually print on tape 2 a number of 1’s equal to the new 
state. If not, T will not halt, hence no move of A is simulated. 

(v) Finally, 7 copies tape 2 onto tape 1 and prepares to simulate 
another move of A. However, if the three phases of the move of A 
have each been successfully simulated, and p is in F, then T' simulates 
no further moves of A, but rather, halts and accepts. 

It is straightforward to see that 7’ will simulate all moves of 4A, 
and will accept exactly when A reaches an accepting configuration. 


We shall now consider three properties of closed classes of languages. 
These properties are that closed classes of languages are closed under 
reversal, intersection and inverse g.s.m. mappings. The third property 
is perhaps the only one in the paper that is difficult to prove. 


Theorem 2: Let C be a class of automata. Let L = T(A) for some A = 
(S, I, M,f,9,h,%, FP) in C. For any w = faa, «++ a,$ in I*, define 
w = ¢4,0,-, +++ a, $. Define L’ = {w]|w”" ts in L}. Then there is an 
automaton, A,, in C such that L’ = T(A,). 


Proof: Let S = {q., @, °++ Qs}. Define S; = {@1, @2, °** Geax}, and 
A, = (81,7, M, f1,91,, G11, F). We define f, and g, as follows: 


1804 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1967 


(2) (fi)o = fa for gin S. 

(72) Gian = a, 

(a7) Suppose g(q, a, m) = (p, d) for some g in S, m in M, and a 
in I — {¢, $}. Then g,(q, a, m) = (p, d) where d = +1, 0 or —1 as 
d = —1, 0or +1, respectively. 

(iv) Suppose g(q, ¢, m) = (p, d), for g in S and m in M. Then 
n(Q, §, m) = (p, d). If CE $, m) = (p, d), then CE ¢, m) - (p, d). 

(V1) Gi(dex1, @ M) = (Yer, +1) for min M andain J — {$}. 

(vt) 91 (Ge+r 5 §, m) 7 (D, d) for m in M, where gu, ¢, m) = (p, d). 

(vit) g, is @ if not defined by (772)—(v2). 

A, is in class C by Lemma 2. We must show that T(A,) = L’. Let 
the input to A, be w, of length n. By rules (27) and (v) it is seen that 
(Gear, W, 0, 1) |Z, (Qs21, w, m + 1, 1). From that configuration, A, 
never returns to state g,.:, but simulates A with the direction of input 
head reversed. 

That is, by rules (¢) and (v2), (q.11, w, m + 1, 1) |z, (py, w, J, 2) if 
and only if (q,, w’, 0, 1) |z (py, w’, nm + 1 — j, 2). Also, by rules (2), 
(zat) and (2v), for any q and p in S, integers 7, , 42, 71, Jo, With 7, and 
jo between 0 and n + 1, (q w, fi, 71) |Z, (p, W, jo, t2) if and only if 
(qq, w,n+ti1— jp, 2) |Z @, w’,)n + 1-92, %). Thus, by induction 
on the number of moves made by A, starting with one move, 


* . . 
(Go+1 , VW, 0, 1) [as (p, WwW, J, 1) 
if and only if (q,, w’, 0, 1) hi (p, w’, n + 1 — j, t). We conclude that 
A, accepts its input, w, if and only if A accepts w’. That is, T7(A,) = L’. 
Note that A could not accept without making a move, since q, is not 
an accepting state. 


Notation: Let h, and hz be balloon information functions, with ranges 
M, and M,, respectively. Let 17, have maximum element k. Define 
h,-h. to be the function [h,-h.](@) = hi@) + (k + 1)h.(). Define 
M,-M, to be the range of h,-h.. We will also need the functions which 
are partial inverses of the - operator. So, we define o,(k, 7) = 7 modulo 
k + 1, and o.(k, 7) = [j/(k + 1)].t If k is as above, andj = [h,-hs](2), 
then o,(k, 7) = hy(2) and o2(k, 7) = ho(2). 

Note that according to the definition of closed class, if h, and hz are 
in Hg, , then h,-h. is in Hg for any closed class, C. 


Theorem on If A, —= CS; 3 I, ) M, ; fi ) 91 ’ hy } q1 ) Fy) and A» —_ (S» } Oe 
My, fo,92,h2,Q2, 2) are automata tn class C, then there is an automaton 
A; in C accepting L = T(A,) (\ T(A2). 


¥ [x] is the integer part of x. 
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Proof: By a simple application of Lemma 2, we can find automata 
accepting 7'(A,) and T(A.), each of whose set of input symbols is 
I,U I, .8o, we will assume that J, = J. = J. Likewise, from Lemma 2, 
we can assume S, and &, are disjoint. We construct a third automaton, 
A; = (S;, 1, Mz, fs, gs, Rs, 91, Fo). Here, Ss = S; US: VU fas}, 
where q3 is not in S, or S,. Also, 47; = A7,-A7, and hz = hy:h.. We 
define fs and g3 as follows: 


(2) If g is in S,, then (fs). = (f1).- If q is in S2, then (fs). = (fo)a- 

(it) (fa)e. = ao”. 

(72) Let k be the largest element in M7, , and let m be in M;, with 
m, = o,(k, m) and mz, = oo(k, m). Let a be in J. Suppose g is in S; 
but not in F,, and gi(q, a, m,) = (p, d). Then g3(q, a, m) = (p, d). 
If gisin F, , gs(q, a, m) = (qs, 0). 

Suppose g is in S; , instead, and g.(q, a, m2) = (p, d). Then g3(q, a, m) = 
(p, d). | 

(iv) gs(qs , 4, m) = (q3, —1), for allain J — {¢} and min M;. 

(v) gs(ds, €,m) = (p, d) if go(q2, ¢, m2) = (p, d), where m, is as in (772). 


From rules (2) and (iz) it is clear that until A, enters an accepting 
state, A; enters a configuration (gq, w, j, 7), gin S,, if and only if A, 
would enter that configuration. If (q,, w, 0, 1) |a, (p, w, J, 2), where 
p is in F,, and no accepting state has been previously entered, then 
by rules (z) and (72), (q., w, 0, 1) a, (, wv, 3, 2) Ia, Gs, w, Jj, 1). If w 
is not accepted by A,, then A, will never enter state q3 . 

By rules (iz) and (2v) (qs, w, 7, 1) la, (¢s, w, 0, 1). By rules (2) and 
(v), (gs, w, 0, 1) |a, @, w, J, 2) if and only if (q2, w, 0, 1) la, @ ¥ J, 2)- 
From this point, A; simulates A, in a straightforward manner, entering 
an accepting state with w as input if and only if A. does. Thus, in order 
for A; to accept w, both A, and A, must accept it, and whenever these 
accept w, A; will likewise accept w. In other words, 7'(A3) = T(A,) O 
T(A.). 

By part II of the definition, and Lemma 2, A; is in class C. 

We are now going to prove a theorem on inverse g.s.m. mappings. 
A generalized sequential machine (g.s.m.) is a finite state transducer.” 
It is usually defined as a 6-tuple, G = (K, 2, A, 6, d, po). K, Z and A 
are the finite sets of states, input symbols and output symbols, respec- 
tively. 6 is a mapping from K X > to K, and X is a mapping from 
K X = to A*. Lastly, po is in K and is called the start state. We extend 
6 and \ to domain K X * as follows: 6(q, 6) = gq and Aq, 6) = « 
for all g in K. For w in &* and a in &, 6(q, wa) = 6(6(q, w), a) and 
A(q, wa) = A(q, w)A(5(G, w), a). Define Gw) = (po, wv). 
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We can define a function y for the g.s.m. G, as above. y maps K X =* 
to the subsets of K. If gis in K and w is in &%*, then 


x(q, vw) = {p | 5p, w) = qh. 
For win >* and ain 2, given y(q, w), we can find y(q, aw) by: y(q, aw) = 
{p | for some p, in y(q, w), 6(p, a) = pi}. 

We intend to prove that if A is a balloon automaton of class C, 
and G is a g.s.m., then there is an automaton, A,, in C, such that 
T(A,) = {¢w$] if Gw) = w,, then ¢w,$ is in T(A)}. We need an 
auxiliary definition and a lemma. 

A two-way finite automaton” is a device with a two way, read only 
input tape and a finite control. Formally, the device is denoted A = 
(K, 2, 6, po, Ff). K and > are finite sets of states and input symbols, 
respectively. = always includes ¢ and $, the left and right endmarkers 
of the input, respectively. F C K is the set of final states, and po , 
in K, is the start state. 6 maps K X = to K X {—1, +1}. Intuitively, 
if 5(q, a) = (p, d), then A, scanning a on its input, in state q, goes 
to state p, and moves its input head left or right, depending on whether 
d = —lor +1. 

We denote a configuration of A, with input w, by (q, w, 7). We assume 
w can be written as ¢w,$, where w, is in (2 — {¢, $})*. Let w, consist 
of n symbols. The position of the input head is indicated by 7. That is, 

= 0 if the input head is scanning ¢, if 7 = n + 1, the head scans $, 
and if 1 S$ 7 S n, the head scans the 7th symbol of w,, counting 
from the left. Thus, ¢ is the zeroth symbol of w, and $ the n + Ist. 
Of course, q is the current state of A. 

Say that (q,, W, 4) [7 (qe, ¥, t%2) if ais the 7,th symbol of w, 6(q, , a) = 
(q2, d) and 7 = 7, + d. However, we must have 0 S$ 7% S$ n+ 1. 
We define the relation |; by (4, uw, 2) la (g, w, 2), for any configuration, 
(q, w, t), of A, and (q,, w, 41) [= (Gm, W, tm) if there are configurations 
(Qo, W, 22), Ga 52RD; ta) ey NO aes WD, i) such that for] Sj < m, 
(q;, w, t;) |Z Gas , wv, i). Although we are not concerned with 
acceptance by two-way finite automata, (they accept the regular sets, 
as is well known) we will define the tapes accepted by A, denoted 
T(A), to be {w| win ¢(= — {¢, $})*$, (po, w, 0) hi (p, w, ‘ ‘for some 
p in F and integer, 7}. 


Lemma 8: Let G = (K, 3, A, 6, , po) be ag.s.m., with ¢ and $ not in 2. 
Then, we can construct a two-way finite automaton, 


A i (Ki, =U {¢, $},6:, G0, F) 


with the following properties: 
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(t) K, ts expressed as K, X K. Elements of K, are denoted [q, p] where 
qgisin K,, pin K. 

(it) gq, and q2 are particular elements of Kz. 

(iit) Let w = a,a. --- a, be in Z*, eacha, in 3,1 Sk S n. Suppose 
5(Po , Aid, --* A;-1) = p,t = 2. Then 


(lq. , Dl, ¢w$, 2) fa (ge, al, ¢w$, ¢ — 1), 


where 5(po, Q,2 *-* G2) = g. Never is q, or q2 the first component 
of state of A, except for the first and last configurations. 

(tv) qo and F are irrelevant, since the lemma concerns, not the recognizing 
power, but the structure of two-way finite automata. 


Proof: This lemma was essentially proven in Ref. 11, with direction 
of input head reversed. We shall, therefore, not give a formal proof, 
but just sketch the argument. The result in Ref. 11 did not involve 
the function 6 of a g.s.m., but another function which had the properties 
needed, properties which 6 has. These properties are: 


(z) 6(g, w) is unique for q in K, w in >*. 

(it) If y is defined as in the definition of the g.s.m., and p, and py 
are in K, p, ¥ pe, then for any w in >*, w ¥ «, y(p,, w) and y(p2, w) 
are disjoint. (For if p were in both, then 6(p, w) = p, = po , violating (7).) 

(222) If ps isiny(p, , w) and p,iny(p., w), and w = ww, with w2 ¥ «, 
then d(ps, wi) # 6(ps, w,). (For if not, let 6(ps , w:) = 6(ps, wi) = P. 
Then y(p, , We) and y(pz , w2) each contain p, and w, ¥ e, violating (72).) 

(iv) If p, and pz are in y(p, w), then 6(p,, w) = d(p., w) = p, by 
definition of y. 

We will now sketch the design of A. Let ¢w$ be its input, w = 


@,d2 °°: a,, as in the statement of the lemma. Suppose the input head 
of A is scanning a; , and A is in state [q, , p]. Presumably, 


5(Po, G42 *** As) = Pp. 


A moves its input head left, and computes y(p, a;-1). If y(p, a:-1) 
contains a single element, p,, then p, must be 6(p9, a@,@2 -*+ G,-2). 
A can easily enter configuration ([q. , p,:], w, 7 — 1). 

It is not possible that y(p, a;_1) is empty. Suppose y(p, a;-1) con- 
tains r elements, r > 1. Let these be p,, po, --:, p,. A moves left. 
For 7 = 1 — 2,7 — 3,7 — 4, +--+ it successively computes 


(Dx @;Qj41 °° * Az) 


from y(p,, @j410j22 °** G;-2) for 1 S$ k S r. Unless the process ter- 
minates, in one of two ways we will describe, A then drops y(pz , @j414j+2 
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- @;-2) from memory. Given G, we can find an upper bound on 7, 
so the amount of information stored in A’s finite control is bounded. 

(z) Suppose that for some largest j, for only one value of k, say k = m, 
is ¥(Mz » @j@j41 °° * @;-2) nonempty. Then surely p,, is 6(Mo , @i@2 °° * G;-2). 
A must find its way back to position ¢ — 1. Presumably, one can find 
k, and k, such that y(pz, , @i+:@i42 *** @i-2) and y(Dz, , Gjsiz+2 *** Ai-2) 
are not empty. Choose s, and s, from these sets, respectively. A then 


moves right, computing 6(s, , Qj11@j;42 °°* G1) and (82, Gj41@j22 °** G2) 
for! = 7+1,7 + 2, --- . By comments (777) and (dv) above, we will 
not have 6(8, , Qj11@j12 °° G1) = 6(S2, AjyarQja2 °° G,) until] = 7 — 1. 


A is thus positioned properly, and can enter configuration 
([g2, Pm], w, 2 — 1). 


(iz) Suppose that no 7 satisfies condition (7). Then A will eventually 
reach the left endmarker. It must be that for some m, po is in Y(Pm , 
G02 *** QA;-2). Thus, p,, iS 6(po, Gid2 °** @;_-2). A must find its way 


back to position 7 — 1. So, A chooses s, and s, in y(pz, , @:@2 °** As-2) 
and y(px, » Aid *** G;-2) for some k, ¥ k, . A moves right, successively 
computing 6(s,, @ya, --+ a) and 6(s2, aa, --* a) forl = 1,2, -+:. 


When 6(s,, @,:@2 °** @:) = 6(S2, @,a2 -*+ G1), we must havel = 7 — 1. 
A easily enters configuration [q2 , p,,], w, 7 — 1). 


Theorem 4: Let A, = (8,,1,,M, fi, 91,h, 171, F1) be a balloon automaton 
in class C. Let G = (K, &, A, 6, \, po) be ag.s.m., where A = I, — {¢, $}. 
Then there ts an automaton, Az in class C, such that 


T(A2) = {¢w$ | EG'(w)$ is in T(A,).}. 
T(A.) ts commonly called an inverse g.s.m. mapping of T(A,). 


Proof: Let A = (K,, =U {¢, $}, 61, @, F) be the two-way finite 
automaton constructed from G in Lemma 3. Let A, = (S2, J2, IM, 
fo, 92,h, 12, F2), where I, = ZU {¢, $}. Let S. = {[g, p, 7, u, 1, k] | ¢ 
in K,, pin K, rin 8,, wa string in (J, — {¢, $})*, of length at most 
max (| \(s, a) | for s in K, a in X), J an integer between 0 and | x |, 
and k an integer between 1 and 8}.f K, is defined as in Lemma 
3, as are its particular elements, gq, and go. 72 = [@2, Po, 11, € 9, 1). 
F, is the set of all states in S, whose last component is 8. 

We shall call the last component of states in S, the pointer. It in- 
dicates, among other things, if A, is simulating A, A, or G. The first 
component is part of a state of A. It is needed because A, may move its 


t |x| denotes the length of string z. 
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head left to simulate A,. In that case, the routine A is needed to 
determine the state of G at the new position of A,’s input head. The 
second component of A.’s state indicates what state G would be in if 
it had processed whatever is to the left of A.’s input head. The third 
component is the state of A,. The fourth component is the output 
when the input to G is the symbol currently scanned by A,.’s input 
head. The fifth component indicates where, among the symbols of the 
fourth component, A,’s input head would be. In Fig. 3, the construction 
of A. is symbolically indicated. 
We define f, by: 


(2) (fe) tap.rutel = a fork = 3, 5, 6, 7, 8. 

(it) (fe) te.p.riu.t.e) = (f1), for k = 1, 2, 4. For min M, p in K, qin 
K,, rin S, but not in F, and a in J, — {¢, $}, we define g. by: 

(221) g2([q2 » Pos"; €; 0, 1], ¢, m) = ([q2 1 Po, 8; €, 0, 1], 0) if gi(r, é, m) oo 
(s, 0). (A, simulates A, , scanning and remaining at ¢ on its input.) 

(tv) go([g2 , Po, 7, €, 9, 1], ¢, m) = ([q2, Po, 8, €, 0, 2], +1) if ni, ¢, m) = 
(s, +1). (A2 simulates A, moving right from ¢. The pointer is set to 2, 
so A, will next compute the output of G for the symbol it will next 
scan on its input.) 

(v) gala, D, 7, & 9, 1], $,m) = ([q2, p, 8, ¢, 0, 1), 0) if gir, $, m) = 
(s, 0). (A, simulates A, scanning and remaining at $.) 

(vt) go([g2, P, 7; €, 9, 1], $,m) = ([a., 7, 8, €, 0, 4], 0) if gilr, $,m) = 





Fig. 3— Automaton Ae. 
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(s, —1). (Az simulates A, moving left from $, and prepares to simulate 
A. The pointer is set to 4, and the first component to qj.) 

(vit) g2(l9, PT, €; 0, 4], a, m) ra go(Iq, P,T, € 0, 5], a, m) . ([q’, PD’; r, 
0, 5]; d) if 6,([g, PI, a) ron (Iq’, Pp’), d), for q — G2 » and a in I,. 
(A, simulates A in A,’s first two components of state. The pointer 
is held at 5.) 

(v227) g2([ae »D,T, € 0, 5], a, m) ie ([¢2 »D, 7, U, l, 6], 0) if u = A(p, a) 
and u + e. Here, | u| = 1. (A, computes the output of G and prepares 
to simulate A,. The pointer is set to 6.) 

(tx) If instead, A(p, a) = & 92([¢2 > DP, 7, & 0, 5], a, m) ae ([q: ; D, 7; 
e, 0, 5], 0). (A. must simulate A again to find an input symbol that 
gives an output = e.) 

(x) gla Po, 7, & 0, 4], ¢, m) = g2([q » Po, T; & 0, 5], ¢, m) and is 
equal to go([g2, Do, 7, & 0, 1], ¢, m) as defined by rules (277) and (2). 
(A, was prepared to begin simulating A, but found itself at the left 
endmarker. Note that in this case, the state of G must be py. Az im- 
mediately simulates A, .) 

(x1) g2([de2 » DT, & 0, 2], a, m) — g2([ae DP, 7, &; 0, 7], a, m) 7 ([q2 , 
p, r, u, 1, 6], 0) if \(p, a) = wand u + «. (Az has simulated a move 
right of A,’s input head. It computes the output of G and prepares 
to simulate A, . The pointer is set to 6, as in rule (v777).) 

(xit) If instead, A(p, a) aS g2([q2 »P, 7, & 0, 2], a, m) ae g2(Ide , DP, 
r, ¢, 0, 7], a, m) = ([qo, t, r, ¢, 0, 7], +1) if d(p, a) = t. (A. must search 
right, in order to find an input symbol that does not give e« output 
when given to G.) . 

(x7272) g2([@2 1D, 7, & 0, 2], $, m) me g2([q2 » DP, T, & 0, 7]; $, m) and is 
equal to g2([q2, p, 7, ¢ 9, 1], $, m) as defined by rules (v) and (v2). (Az 
was simulating a move by A,, but encountered the right endmarker. 
A, immediately simulates another move of A, .) 

(xiv) Suppose u ~ e and 1 $7 S | u |. Also, suppose g,(r, b, m) = 
(s, d), where b is the Ith symbol of u, and 1 S1+ dS |u|]. Then, 
g2([de » P, T, U, l, 1], a, m) = g2([ge , DP, T, U, l, 6], a, m) = (qe P,; §, 
u, | + d, 1], 0). (Az simulatesa move of A,, where A, is assumed 
scanning the /th symbol of wu.) 

(xv) Under the assumptions of (viv), if 1 + d = 0, go([g2, p, 7, U, 
I, 1], a, m) = g2(lqo, p, 7, U; l, 6], a, m) = ([m, D, 8, €, 0, 4], 0). (Ae 
simulates A, , but finds that A, moves left from wu. A, prepares to sim- 
ulate A.) 

(xvi) Under the assumptions of (xiv), if] + d > |u|, go([qe, p, 7, 


} Recall 5, is the next state mapping of A. 
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U, l, 1], a, m) = g2([qe , P; tr, U, L; 6], a, m) = ([ge2 , t, 8, €, 0, 2], +1), 
where ¢ = 6(p, a). (Az simulates A,, but finds that A, moves right 
from u. Az simulates the state transition of G.) 

For rin F, and any k: 


(xvit) go([q, p, 7, u, 1, kj, a, m) = ([g, p, 7, u, l, 8], 0). (A, has been 
simulated entering an accepting state. A, sets the pointer to 8 and 
accepts.) 


By rule (avi) above, we see that exactly when A, gets to a state 
with third component in F, will it accept. It is sufficient to show that 
A, can simulate any single move of A, which does not start from an 
accepting state. 

Formally, let us focus our attention on a particular input, ¢w$, 
to A,, where w is in (J, — {¢, $})*. Let G(w) = v and | v| be n. For 
this particular w, and configuration (r, ¢v$, 7, 2) of A,;, we define the 
inverse image of (r, ¢v$, 7, 2), denoted II(r, ¢v$, 7, 2) as follows: 


(z) If j = 0, then (Ig, Po, T, € O, kj, éw§, O, 2) is in II(r, ¢v$, 0, 2) 
if eitherk = landg = qork = 4andq=q,,ork = 5andq=q. 

(i) If 7 = n + 1, then ((q., p, 7, ¢ 0, kl, ¢w$, n, + I, 2) is in II(r, 
¢v$, n + 1, 7) if p = 6(po, w) andk = 1, 2 or 7. Here n, = | w |. 

(722) Ifls j s n, (qe , Pp, 7, U, l, kj, ew, hi, 2) is in II(r, ¢v$, ds 1) 
if one can write v = vu, and w = w,aw,,ain I, — {¢, $}, such that 
the following is true: 


I 


(a) (M0, Wi) = p 
(b) Apo, Wi) = VY 
(c) \(p,a) =u xe 
d@) n=lwu{[+1 
(e-) g=|[a[ +1 
(Gf) k=1or6. 


Intuitively, A,’s input head is scanning the symbol giving rise, 
when fed to G, to the symbol scanned by the input head of A,. 


We must show that if (r, ¢v$, j:, 71) [z, (s, ¢v$, jo, %), and r is not 
in F,, then if ([¢g, m1, 7, 1, 1, kil, ¢w$, js, 21) is a configuration 
in II(r,, ¢v$, 7, , 7), then there is some configuration ([q’ , po, 8, Ue ; 
ls, ke], ¢w$, js, 22) in I1(s, ¢v$, j2 , 22) such that: 


(Iq »P1,7, U1 ? l, ? k,], ¢w§, 35 ji) ire ([q’ » P2,8, U2, l, ? ke], ew, J43 12). 


Case 1: j, = jo = 0. The result follows trivially from rules (iz), (c72) 
and (2). ; 
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Case 2:9, = jo = n + 1. Trivial from rules (72), (v) and (izz). 


Case 3: 3, = 0, j2 = 1. By rules (22), (iv) and (x), ([¢, po, 7, € 0, 4, 

gw§, 0, 1) ree [ge » Do, §; & 0, 2], ew, i 2): By rules (2), (xt) and (wit), 

if v ¥ €, ([g2, Do; 8, e, 0, 2] , $u$, 1, Zo) Iz ( [a , D, 8, U, 1, 6] , CWS, j, 12), 

where if w = a,@. -+- a,,, then oe Q,Q2 °** G;_1) = DP, A(Po, A142 

a@;-1) = e and X(p, a;) = u. If v = «, by rules (z) and (zit) ([q , 

Po, §; & 0, 2], ew, 1, 12) re ([q2 » DP, 8, €, 0, k,], cw, n - 1, to), where 
= 6(po, w) and k, = 2 or 7. 


Case 4:7, = n + 1, jp = n. By rules (iz), (vz) and (xizz), ([q2, pi, 
r, €,0, k], ¢w,$ni + 1, 21) [7, (lq » pa 5 8) €, 0, 4], ¢w$, m, + 1, 2). If v ¥ e, 
by Lemma 3, and ites (t), (viz), (vitd) and (zx), ([q:, pi, 8, ¢, 0, 4], 
¢w$, 1; , t2) IF. ( [Qo Do, 8, U, l, 6], ¢w$ j, 72), where if w = @,Q2 +++ An, , 
5 (poe vet Az 1) = Pp2, Do, a Q1d2 °** A;z- -1) = Ui, vu = I; and 
Mp2, @;) = u. Ifv = « by Lemma 3 and rules (2), (viz) and (zz), 
(lq » Pi, 8; & 0, 4], ew§, nm, 72) re (lq. » Po, 8, 0, k], ew§, 0, 12), where 
k = 4or 5. 





Case 5: j, is not 0 or n + 1. Also, 1 + jo — 7; lies between 1 and | u |, 
where / and wu are defined in part (i722) of the definition of inverse image. 
The result is immediate from rules (22) and (iv). 


Case 6:j, isnot Oorn + 1, butd = |u| andj, = 7, + 1. By rules (zz) 
and (xv7), ([q2 , Pi, 7, U, l, kl, ew, Je ? 7) las ([g2 , P2, 8, & 0, 2], ew$, 
js + 1, t2), where ([q2, pi, 7, u, U, k], ¢w$, Js, 7,) is either of the inverse 
images of (r, ¢v$, j: , 21). The rest of the argument for this case is similar 
to that of case 3, and will be left to the reader. 


Case 7: 7, is not 0 or n + 1, but 2 = 1 and j, = j, — 1. By rules (22) 
and (xv), ([qe2 ) D; r, Uy, l, k], ¢w§, Js ? 11) lag ([qe ) P> S, €, 0, 4], ¢w§, Js 5) 12); 
where the former configuration is again either of the inverse images 
of (r, ¢v$, 7, , 21). The argument proceeds as in case 4. 

We claim, from the above, that ([qg2, Po, 71, € 0, 1], ¢w$, 0, 1) pa 
(lg, ps rs UL, Hi], EwS, fh, a) Ia, (Ig Dm wb 8), dw, ir , 1), for some r 
in Ff, , k ¥ 8, if and only if (r,, ¢v$, 0, 1) |z, (7, ¢v$, J, 1) by a sequence 
of moves for which A, never previously enters an accepting state. 
Here u, 1, j and j,; are related as in part (i77) of the definition of inverse 
image. Thus, T(A.) = {¢w$ | for some v with ¢v$ in T(A,), G(w) = v}. 
We must add that by Lemma 2, A, is in class C. The theorem is thus 
proven. 
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IV. OTHER TYPES OF BALLOON AUTOMATA 


We have considered the two-way deterministic balloon automaton. 
To complete the story we should consider three other models—non- 
deterministic two-way balloon automata, and one-way balloon auto- 
mata of the deterministic and non-deterministic varieties. 

A nondeterministic device typically has the choice of a finite 
number of possibilities for each move. We choose to make the finite 
control function nondeterministic. This added capability enables us to 
represent the nondeterministic versions of the seven types of automata 
which we could represent by a deterministic balloon automaton. 

A one-way balloon automaton is, quite naturally, a two-way 
balloon automaton, restricted so that the input head can only move 
right or not move at all. 

We shall not repeat the definitions for each of the three new types 
of balloon automata, but, as a model, shall make use of the definition 
of two-way deterministic balloon automata. 

A two-way, nondeterministic balloon automaton is denoted A = 
(S, I, M, f, 9, 2, ¢@0, F) where all components are defined exactly as 
for the deterministic case, except that g is a mapping from S X I X M 
to the subsets of S X {—1,0 + 1}. 

A one-way, deterministic balloon automaton is denoted as are the two- 
way types, but gis a mapping from S X I X M to (S X {0, +1}) U {¢}. 

A one-way nondeterministic balloon automaton is denoted as are the 
two-way types, but g is a mapping from S X J X M to the subsets 
of S X {0, +1}. 

The closed classes of one way nondeterministic balloon automata are 
similar to the abstract families of acceptors in Ref. 21. 

We shall use the abbreviations 2DBA, 2NBA, 1DBA, and 1INBA 
for, respectively, two-way deterministic, two-way nondeterministic, 
one-way deterministic and one-way nondeterministic balloon automata. 

A configuration of any of the four types is denoted as for the 2DBA, 
(q, w, j, 2), where, q is the state of finite control, w the input, j the 
input head position, and 7 the state of the balloon. 

The possible moves of the 2NBA are determined as one would 
expect. One uses the balloon information function. Based on the value 
of that function, the input symbol at the position of the input head, 
and the state of finite control, one chooses a pair of next state of finite 
control and direction of input head, according to g. Then, based on 
the new state, the balloon control function is used. 

Formally, if A = (S, J, M, f, 9,2, ¢0, F) isa 2NBA, and (q1, w, j1 , 21) 
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and (q2, W, je, t2) are configurations of A, with n the length of w, 
then we say (q1 , W, ji, 41) goes to (q2 , W, jo, t2) by a single move, denoted 
(qi, W, jr, ts) |Z (Ge, W, jo, t2) exactly when for some m in M, a in 
I,d = —1, 0 or +1, we have h(z,) = m, the j,th position of w is a, 
g(qi , a, m) contains (q., d) and f,,(7,) = i,. Also, 7, + d is between 0 
and n + land j. = 7, + d. If (q1, w, 7:, 7:) can go to configuration 
(dz, W, jo, %2) aby some number of moves, including 0, then we say 
(M1, W, try %1) Fi (2, W, Jo, t2). 

The notion of move, and the relations |— and I~ are defined for the 
1DBA and INBA exactly as for the 2DBA and 2NBA, respectively. 

A 2NBA accepts an input, w if for some choice of moves it enters 
an accepting state. Formally, define T(A), for a 2NBA, A = (S, J, 
M, f, 9, h, Go, F) to be {w | (qo, w, 0, 1) [a @, w, J, 2) for some g in F}. 

For the one-way types, we require that the input head reach the right 
endmarker when it accepts. That is, if A = (S, J, M, Lf g,h, 90, F)isa 
INBA or 1DBA, then T(A) = {w] (qo, w, 0, 1) [5 (q, w, n + 1, 2) 
for some g in F, where n is the length of w}. 

The notions of closed class of balloon automata for the 2NBA, IDBA 
and INBA are defined exactly as for the 2DBA. 

Note that, for example, a 1DBA is not a INBA, although there are 
obvious relationships. Also, strictly speaking, a closed class of 1DBA 
is not a closed class of INBA. Both parts I and II of the definition for 
INBA would require nondeterministic finite control functions in any 
class of LINBA. Analogous statements hold between 2DBA and 2NBA, 
IDBA and 2DBA, 1NBA and 2NBA. 

It is trivial to see that Lemmas 1 and 2 hold for the 2NBA, IDBA 
and INBA. 

A set of languages is said to be a closed class (or simply class) for the 
2NBA, 2DBA, INBA, or IDBA if they are exactly the languages 
accepted by a closed class of automata of that type. 


V. TWO-WAY NONDETERMINISTIC BALLOON AUTOMATA 


Theorems 2, 3 and 4, proven for the 2DBA also hold for the 2NBA. 
In each case, the simulation by an automaton in some class, C, of one 
or two other automata in C' was involved. In the 2NBA case, the simula- 
tion can be nondeterministic if the simulated automata are. We will 
therefore omit the proofs of the three theorems for the nondeterministic 
case. 

Likewise, Theorem 1 holds for the 2NBA. We can simulate a 2NBA 
by a nondeterministic Turing machine just as we simulated the 2DBA 
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by a deterministic Turing machine. A nondeterministic Turing machine, 
as is well known, can be simulated by a deterministic Turing machine. 

There is one additional, simple theorem we can prove for the 2NBA 
but not the 2DBA. 


Theorem 5: If L, and L, are languages accepted by automata A, and 
A,, respectively, in class C of 2NBA, then there is an automaton, A3, 
in C accepting L,U L,. 


Proof: Let A, = (8,, 11, M1,hf1,91,%1,%, F:) and A, = (82, J2, 
Ms, fo, 92, h2,Q2, Pe). AS was mentioned, by Lemma 2 we can assume 
that S; (\ S. = g and J, = J, = J. Consider a new automaton, A; = 
(S3, I, Ms, fs, 9s, hs, 4s, F's). Ss = S,; US. U {qs}, where qs is not 
in S,;U S,. Fs; = F, U F,. Mz = M,-M, and h; = h,-h..f Define 
fs by (fs)as = @, (fs)a = (frJaif gis in S,, and (fs), = (f2),if qisin S,. 

Let the largest element of M/, be k. We define g; as follows. For a 
in J and m in M3, let m, = o,(k, m) and m, = o.(k, m). If gisin S,, 
then gs(q, @, m) = gi(q, a, m). If g is in Sz, gs(g, a, m) = g(q, a, me). 
Finally, gs(qs , a, m) = gi(qi, @, 1) U go(qe , a, me). 

It is straightforward to see that A; is in class C. 

It should be clear that for any input, w, (q3, w, 0, 1) |Z, @ w, 7, 
exactly when either q is in S, and (q,, w, 0, 1) lz, (q] w, j, 2) or ¢ is 
in S, and (q2, w, 0, 1) |a, @, w, j, 7). Also, once in a state of S,, As 
remains in a state of S, and simulates A,. Likewise, in a state of S., As 
simulates A,. Thus, by induction on the number of moves made, 
starting with one move, we have (qs, w, 0, 1) |z, (q, w, j, 2) if and only 
if (a1, w, 0, 1) [k, @ @, 3, 4) or @, w, 0, 1) [2, @ w, 3, 2). Thus, since 
F, = Ff, F,, and neither q, or g. may bein F; , it follows that T(A;) = 
T(A,) U T(A,). 





VI. ONE-WAY DETERMINISTIC BALLOON AUTOMATA 


The 1DBA is the poorest of the four types in terms of the operations 
on languages which preserve membership in a closed class of languages 
for given types. Of the operations preserving membership in class for 
the two-way devices, only inverse g.s.m. mappings preserve member- 
ship in class for the 1DBA. The proof is along the lines of that of 
Theorem 4, but is simpler because the input head never has to move 
left. We will omit the proof. 

There is one new operation which does preserve classes for the 
1DBA, and, incidently, the INBA. This operation is intersection with 


co 


f Recall the definition of the operation , o, and o2 in Section III. 
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a regular set. Classes for the 2NBA and 2DBA were closed under 
intersection of languages in the class. A simple use of part I of the 
definition of closed class shows that every regular set is in every closed 
class, so intersection with a regular set surely preserves membership 
in class for the 2NBA and 2DBA. 

We shall give the usual formal definition of a finite automaton. 
See Ref. 20, for example. A finite automaton is a 5-tuple, A = (K, 2, 
6, 9, /). K is the finite set of states, Z the finite set of input symbols. 
F is a subset of K, the final states, and qo, in K is the staré state. 6 is 
amap from K X = to K. We extend 6 to domain K X 2* by 6(q, «-) = 
for all gin K, and 6(q, wa), for gin K, win 2* and ain 2 is 6(6(q, w), a). 
Define T(A) = {w | 4(q., w) is in F}. The finite automata accept 
exactly the regular sets. 


Theorem 6: Let C be a class of one-way, deterministic balloon automata. 
Let L be accepted by some automaton, A in C, and let R be a regular set. 
Then L (\ R ts accepted by some automaton in class C. 


Proof: Let A = (S, J, M, f, 9g, h, @, F) be a IDBA. Let R, = RM 
¢(I — {¢, $})*$, and let R. = {w| w$ is in R,}. If RK is regular, then 
R, and R, are both regular. It is sufficient to show that there is an 
automaton in C accepting L (\ R,. To that end, let A, = (K, J, 4, 
po, F,) be a finite automaton with 7(A,) = R,. Define A, = (S,, J, 
M, fe, g2, h, G2, Fe) to be a IDBA, with S, = S X K, gq = [qo, Dol 
and F, = F X F,. Define f, and gz as follows, for all g and gq, in S, p 
and p,in K,ainJ andmin d/: 


(z) Suppose g(q, a, m) = (q,, 0). Then for all p in K, g2([g, p], a, m) = 
([q: 5) pl, 0). 

(27) Suppose g(q, a, m) = (qg,, +1) and d6(p, a) = p,. Then g,(lq, p], 
a,m) = ([q%, pi], +1). 

(472) (fe)ter) = fq for all pin K. 


The states of A,.’s finite control have two components. The first 
is a state of A and the second a state of A,. By rules (2) and (zzz), 
when the input head of A does not move, Az simulates a move of A, 
but does not change the state of A,. By rules (72) and (diz), when 
the input head of A moves right, Az simulates that move also, but 
adjusts the state of A, in the logical manner. 

Formally, we can show by induction on the number of moves of 
A or Az, starting with 0 moves, that ([q , Pol, w, 0, 1) |a, (Ig, pl, w, 4, 2) 
if and only if: 
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* 

(2) (Qo; Ww, 0, 1) la (4, UW, i, 1), and 

(tz) 6(po, W,) = p, where w, is that portion of w to the left of posi- 
tion j. 

Now a, word, w, of length n, is accepted by Az if and only if ([q@o , Dol, 
w, 0, 1) |Z, ([¢, p], w, m + 1, 2), for some gin F, pinF, and any integer, 7. 
The above is equivalent to saying that (qo, w, 0, 1) [a (q, w, n + 1, 2) 
and 6(po, W,) = p, where w,$ = w. That is, w is in T(A) and w, is 
in T(A,). But w, is in R, = T(A,) if and only if w is in R,. Thus, 
T(A,) = LC\R,.It should be clear, by Lemma 2, that A, is in class C. 


Corollary 1: If L ts a language in class C for the 1DBA, and R is a 
regular set, then L — Rts in class C. 


Proof: Let L be contained in J* for some finite alphabet, 7. Then L — 
R = LQ (I* — R), which is in class C by Theorem 6. 


Theorem 6 applies also to the 1NBA. In fact, there is an additional 
corollary that can be shown for the INBA. 


Corollary 2: Let L be in class C of 1NBA, and let R be a regular set not 
involving symbols ¢ or $. Then L U ¢R§$ is in class C. 


Proof: The results is a simple extension of Theorem 6, and will be 
left to the reader. 


VII. ONE-WAY NONDETERMINISTIC BALLOON AUTOMATA 


As the IDBA was the poorest of the four models, in terms of provable 
properties, the 1NBA is the richest. Theorem 4, concerning inverse 
g.s.m. mappings, certainly holds for the INBA, as do Theorem 5, 
Theorem 6 and its corollaries. 

To begin a study of the INBA, we will show that with the proper 
definition of acceptance, endmarkers on the input are not necessary. 
Let A = (S, J, M, f, 9, h, %, F) be a INBA. We informally define 
T(A) as the set of strings, w, in (I — {¢, $})* which cause A to leave 
w moving right, at the same time entering an accepting state. 

We need a slightly revised notion of a configuration. Since w has 
no endmarkers, its length is the number of symbols comprising w. 
(Recall, we never counted endmarkers in determining length.) Let w 
be of length n. Then (q, w, j, 2) is a configuration of A if q is in S, 7 
is an integer and 1 S j S n. The initial configuration for a INBA 
without endmarkers is (q., w, 1, 1). For convenience, we define a 
configuration, (*), which is imagined to result when A is in a configura- 
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tion (q, w, n, 7), and the finite control function allows A, on the next 
move, to move its input head right and enter an accepting state. T 
There is no change in the definitions of |7 and |. The former relates two 
configurations if the second is obtainable from the first by a single move, 
and the latter — if by some finite number of moves. Note that no con- 
figuration can result from (*). P 

Now, we define 7'(A) as {w]|(q., w, 1, 1) [7 @)}. When talking 
of a INBA and the Tf definition of acceptance, we will allow the start 
state to be an accepting state. If so, we shall, by convention, say that 
eis in 7(A). We will endeavor to show that a language is 7(A,) for 
some INBA, A,, if and only if it is 7(A2) for A, , a INBA in the same 
classes as A, . The result is broken into two parts. 


Theorem 7: Let Ay = (81, I, M, fi, 9:1, 2, 1, Fi) be a INBA with 

= 7(A,). Then there is another INBA, Az = (S2, I, M, fe, gz, h, 
Qo, F2), such that ¢L$ = T(A.).t Moreover, if A, is in some closed class, 
C, then A, ts in C. 


Proof: Choose q2 to be a symbol not in S,, and let S,; = 8S, U {q}. 
F, = F,if q, is notin F,; F, = F, U {q2} otherwise. Define f, and g, 
as follows: 


(7) (foes = 
(22) (fo) = (fide for q in S,. 


For all ain I — {¢, $} and all min M: 


(272) g2(Q, a, m) = 91 (9, a, m), for q in S, : 
(tv) G2(qo » a, m) = 91(% >, m). 
(v) g2(Q, $, m) = 9, for q in Sp. 
(vt) go(q2, ¢,m) = {(q2, +1)}. 
(vit) go(q, ¢, m) = ¢ for gin S,. 


Let w, of length n = 1 be in (J — {¢, $})*. By rules (2) and (v1), 
we have (q., ¢w$, 0, 1) |Z, (qe, ¢w$, 1, 1). By rules (72) and (2v), it 
follows that (q. , ¢w$, 1, 1) |z, (g, ¢w$, j, 2) if and only if (q., w, 1, 1) 
lz, (gq, w, j, 2). Then, by induction on the number of moves made, 
starting with one move, we.see that (q., ¢w$, 1, 1) lz, (¢, ¢w$, Jj, 2) 
if and only if (¢,, w, 1, 1) |Z, (q, w, 3, 2), for j S n. Finally, by rules 
(1) and (722), if (g, w, n, 2) |Z (*), then it must be that g,(qg, a, m) con- 








} Strictly speaking we require also that the balloon control function be defined 
for t and the new state if configuration (*) is to be entered. 

t Recall that by convention, ¢ and $ are in every set of input symbols, even if 
endmarkers are not used. 
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tains (p, +1) for some p in F, , where h(z) = m and ais the nth symbol 
: w. Also, fp(2) is defined, so (gq, ¢w$, n, 7) |z, (p, ¢w$, n + 1, 7), where 

= f,(¢). Thus, if w is in T(A,), then nS is in 7'(A,). 

“Tf (qa, ¢w$, 0, 1) |a, (p, ¢w$, n + 1, 2), where 7p is in F,, from rule 
(v) we see that A, could not have tape a move while scanning the $. 
Thus, for some q in S and integer, k, (qi, ¢w$, 0, 1) |z, (¢, ¢w$, n, k) 
lz, (p, ¢w$, n+ 1, 2). From the previous paragraph, we know that 
(q., w, 1, 1) e, @ w, n, b) and (, w, », b) |z, @). Thus, if ¢v$ is in 
T(A,), then w is in 7(A,). 

One detail remains, concerning the case w = «. If ¢ is in T(A,), 
then q, is in F,. Thus, g2 is in F.. By rule (vz), (q2, ¢$, 0, 1) a (qo 
¢$, 1, 1), so ¢$ is in 7(A.). If ¢ is not in 7T(A),, then q, is not in F, 
and g2 is not in f,. By rules (v) and (vz), only one move of Az is pos- 
sible, and Az does not accept ¢$. We conclude that 7(A.) = ¢L$. 
It is clear from Lemma 2 that Az is in class C. 


Theorem 8: Let A, = (81,1, M, fi, 91,2, , Fi) be a INBA, in some 
closed class, C, with L, = T(A,). Let L, = {w| ¢w$ is in L,}. Then 
there is a NBA, Az = (S2, 1, M, fo, g2, h, [q., 1], F2) in class C, with 
T(A.) = L. 











Proof: We will place in S; all symbols of the form [q, 2], where g is in 
S, andi = 1, 2, 3, or 4. If ¢$ isin L,, then F, = {[¢,, 1]} U {lg 4] | ¢ 
in S,}. If ¢$ is not in L,, F. = {[g, 4] | q in S,}.} Define f. and g. as 
follows: 


(t) Gedtecr = (fi)¢ for all g in 8S, andi = 1, 2, 3, 4. 
For all min M, qin S, andainJ — {¢, $}: 


(it) go([g, 1], a, m) = {([p, 1], 0) | (p, 0) isin g,(g, ¢, m)} U {([p, 2], 0) 
| (p, +1) is in g,(q, ¢, m)}. 

(zit) go([gq, 2], a, m) = {([p, 2], d) | (p, d) is in gig, a, m), d = 0 
or +1} U {([p, 3], 0) | (p, +1) isin g,(g, a, m)} U {([p, 4], +1) | (@, + 1) 
is in gi(q, a, m) and p is in Fj}. 

(v) ga(lg, 3], a, m) = {([p, 3], 0) | (p, 0) is in g.(q, $, m) and p is 
not in F',} U {([p, 4], +1) | (p, 0) is in g,(q, $, m) and p is in Fy}. 

(v) go([q, 4,], a, m) = ¢ for any ain J, including ¢ and $. 


Intuitively, when the second component of state of A, is 1, A. imagines 
it is reading ¢ on its input. If the second component is 3, it imagines 
} Obviously, it may not be possible to tell whether ¢$ is in ZL: . In that case, the 
procedure given here can be thought of as defining two automata, one of which 


accepts L,. Computation of Az from A; is not effective, but this fact will not alter 
our theoretical results. 
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it is reading $. If the second component is 2, it uses the symbol actually 
scanned. A second component of 4 indicates an accepting state. 

Formally, let w be in I — {¢, $})*, of fracee n = 1. From rules () 
and (22), we see, that ([q., 1], w, 1, 1) = p, lj, w, 1, a if and onlyif 
(na; gw§, 0, 1) . (p, cw, 0, 2). Also, ([q, 1], w, 1, i,) kk. p, 2], w, 1, z2) 
if and only if @, ew, 0, 1) ke (p, ¢w$, 1, to). * 

Next, by rules (2) and (221), ({[p P; 2); w, 1, 1;) [a5 (Ip, 2], Ww, nN, t2) if 
and only if (q, ¢wS, 1, 11) ie (p, ew, n, . Also, ([a, 2), Ww, n, 1) ve 
([ ? 3], Ww, nN, 12) if and only if (q, gw, n, 11) las (p, ¢w$, n fs 1, to). 
In addition, ({q, 2], w, n, 4,) [z, (*) if and only if (g, ¢w$, n, 4) [z, @ 
¢w$, n + 1, 7,) for some p in F,. For in the latter case, ([p, 4], +1) 
will be in g2(q, a, m), where m = h(2,) and a is the nth symbol of w. 
Note that (f2);,,4; is defined exactly when (f,), is defined. 

Third, by rules (z) and (2), (Ig, 3], w, n, 21) = ( [p, 3], w, n, to) if 
and only if (qg, ¢w$, n + 1, 7;) 5 a. (D, dw$, n + 1, i) by a sequence 
of moves such that A, does not enter a state of F,. Also, ([g, 3], w, ”, 23) 
Iz, (*) if and only if (g, ¢w$, n + 1, 4) [z, (», ¢wS, » + 1, 4») for some 
pin F,. 

Putting together the results above, we have that for w, of length 
n= 1, ((q1, 11, », 1, 1) |G, (* if and only if (q , ¢w$, 0, 1) [a (p, wv, 
n+ 1, 7) for some p in F, and integer, 7. Also, [g,, 1] is in F, if and 
only if ¢$ is in L,. Thus, ¢ is in 7(A,) exactly when ¢$ is in 7'(A,). 
We conclude 7(A,) = L,. It is again straightforward to see that 
A, is in class C. 

We say a closed class of automata is recursive if there is an algorithm 
to determine if any given word is in T(A), for any automaton A in 
the class. We have a corollary to Theorem 8. 


Corollary: If C ts recursive, then for L, and Lz as in Theorem 8, we can 
effectively find an automaton, Az, with T(A2) = Lz, from the specification 
for A,. 


Proof: It is sufficient to note that in this case, we can effectively de- 
termine if ¢$ is in L, , hence we can effectively find A.. 
We can now prove a series of closure properties of the INBA. 


Theorem 9: Let A, and A, be 1NBA in some closed class, C. Let Ly = 
T(A,) and L, = T(A2). Then there exists A, in C with T(A,) = LiL. = 
{w|w = uv and wis in Z,, vin Ly}. 

Proof: Let A, = (81, J, Mi, fi, 91, 1, G%, Fi) and Az = (S2, J, 
Me, fe, 92, he, G2, F2). By Lemma 2, we can assume that S, and 
S. are disjoint, and L, and LZ» are contained in (I — {¢, $})*. Define 
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A; = (Ss , I, M;, fs, 93; hs, In, 1], F,), where hs = Ay-he, M; 
M,:M,, and 8; = 8, U {qs} U {lq, 27] | qin S,, 7 = 1 or 2}, where 
gs is a new symbol. 7(A;) will not be L,L., but rather {w|w = uw, u 
and v ¥ ¢, win L,, vin L,}. We define f, and g; as follows: 


()  (fs)te.cr = Cfi)q if gis in S,, 7 = lor 2. 
(71) (fs)e = (fe)a if qd is in So. 
(tit) (fsa, = a”. 


For allain I — {¢, $} and min M;, with k the largest element in M, : 


(iv) If qgisin S, but not in F, , then g;([q, 2], a, m) = {([p, 1], 0) | (p, 0) 
is in n(@ a, oi(k, m))} U {([p, 2], +1) | (py, +1) isin gi(q, a, o1(k, m))}, 
t = 1lor2. 

(v) If gisin F, , then gs([q, 1], a, m) is defined as in (7v). ga([g, 2], a, m) = 
{([p, 1], 0) | (p, 0) is in gig, a, ork, m))} U {([p, 2], +1) | @, +1) 
is in 91(9, a, oi(k, m))} U {(@s ’ 0)}. 

(vt) 93(Ys » a, m) = 92(Qo » a, o2(k, m)). 

(vit) gs(q, a, m) = g2(q, a, o2(k, m)) for all qin S,. 


Note that if A; is in a state of the form [g, 2], then on its last move, 
its input head moved right. If in a state of the form [g, 1], the input 
head did not move right on the previous move. When A; has just 
moved right and entered an accepting state, according to rule (v), 
it has the option of continuing to simulate A, or going to state q;, 
resetting the balloon to state 1, and then simulating A2. 

Formally, from rules (iv) and (v), it is straightforward to show 
that ([q., 1, w, Js 1) =. ( [p, 2], w, j, 2) for j = 2 and =: in F, if and only 
if (qm, u, 1, 1) i" 4, (*), where w is the first 7 — 1 symbols of w. Certainly, 
if and only if p is in F, does ([p, 2], w, j, 2) [z, (@s, w, J, 1), by rules 
(itz) and (v). Finally, by rules (iz), (vt) and (vit), oo w, j, 1) [z7, © 
if and only if (q., v, 1, 1) [5 a, (*), where v is the jth fad subsequent 
symbols of w. 

Thus, 7(A;) = {w|w = w with u, v ¥ 6 win L,, v in L,}. Clearly, 
A; is in C’. Suppose e is in L,. By Theorem 5, redone for the INBA, 
and Theorems 7 and 8, there exists A, in C with T(A,) = T(As) U In. 
If eis not in L,, let Ay = Az. If € isin L2, there exists A; in C with 
T(A;) = T(A, U L,. If is not in L,, let A; = As. Finally, if « 
is in both Z, and L,, by Corollary 2 to Theorem 6, redone for the 
INBA, and Theorems 7 and 8, there exists A, in C with T(A.) = 
T(A;) U {e}. Otherwise, let 4g = A;. In any case, it should be clear 
that T(A,) = L,Lp. 


Corollary: If C is recursive, A, can be effectively found. 
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Proof: Immediate from the corollary to Theorem 8. 


Theorem 10: Let A be a INBA in class C, with T(A) = L. Then there 
is an automaton, A;, in C, with T(A;) = L* = {eh ULULLU 
LLL WU :--., 


Proof: By Corollary 1 to Theorem 6, redone for NBA, and Theorems 
7 and 8, there exists A, in C with 7(A,) = L, = L— {e}. Note that 
L* = L* , Moreover, since ¢ is not in 7'(A,), we can always effectively 
find A,. Let A, = (8,, 7, M, fi, 91, 2, 1, Fi). We will construct 
Ay = (82,1, M, fe, g2, hy 92, Fs) in C, with 7(A2) = L¥ — {e}. Define 
So = {qo} U {[g, 7] | qin S,,7 = 1 or 2}, where q is a symbol not 
in S,. Let F. = {[g, 2] | qin F,}. Define f, and g2 as follows: 


(2) (fo)taay = Ga for gin 8,,7 = 1 or 2. 

(22) (fea = a. 

For all ain I. — {¢, $} and min M: 

(iii) If q isin S, — F,, go((q, 2], a, m) = {([p, 1], 0) | (p, 0) is in 
gi(q, a, m)} U {([p, 2], +1) | (p, +1) is in g,(g, a, m)}, 7 = Lor 2. 

(iv) If gis in F,, go([g, 1], a, m) is as in rule (772). g2([g, 2], a, m) = 
{([p, 1], 0) | (p, 0) is in gig, a, m)} U {(lp, 2], +1) | (p, +1) is in 
gi(q, a, m)} U {(@2, 0)}. 

(v) g2(Q2, a, m) = gilqi, a, m). 


The significance of 1 and 2 in the second component of state 
of S, is as in Theorem 9. A, simulates A,, but when in an accepting 
state, Just having moved its input head right, has the ie of entering 
state g.. Thus, it is easy to see that (q@, w, Fp dla Dy 21, Wedag hs 
with jo > j; if and only if (q,, u, 1, 1) |z, (4), where wu is aymbols hi 
through 7, — 1 of w. Exactly when p is in F', do we have ([p, 2], w, jo , 2) 
Iz, (2, W, jo, 1). Thus, if and only if (q., w, 1, 1) as (*), can w be 
written in the form uyu, --- u,,k 2 1, whereu;,1 S71 k,isinl,. 

Thus, T7(A,) = L* — {e} = L* — {e}. Surely, A, is in class C. 
By an argument used in Theorem 9, we can find A; in C, with T(A;) = 
T(A.) U {e} = L*. Moreover, since ¢ is in 7(A;), we can always 
effectively find A;. 








Theorem 11: Let A = (S, I, M, f, g, h, Wo, F) be a INBA 1n class C. 
Let G = (K, I — {¢, $}, I, — {¢, $}, 6, d, po) be ag.s.m. We assume 
for convenience that I, is a finite alphabet containing ¢ and $. Let L = 
{w | ¢w$ isin T(A)}. Then there is an automaton, A, in C, with T(A,) = 
¢L,$ and L, = G(L). 


UNIFIED THEORY OF AUTOMATA 1823 


Proof: Let A, = (81,11, M, fi, 91, h, a, £1). The proof is represented 
in Fig. 4. The finite control of A, contains a generator which non- 
deterministically generates symbols in J — {¢, $}. These symbols 
are processed by G, and compared with the input. The input head 
rests on the leftmost uncompared symbol. A, also uses the generated 
symbols as inputs to A, which it simulates. A, accepts if A accepts 
while A, is scanning $ on the input, with no symbols left to compare. 

We define S, = {[q, p, a, u, 2] | q in S, Pp in K,ain I — {$} 
ora = 67 = 1,2or3 and win (J — {¢, $})*, but | wu] < max (| A(p, a) | 
for pin K, ain J.}. The first component keeps track of the state of A, 
the second, of the state of G. The third component holds the symbol 
generated, and the fourth, the output of G for that symbol and the 
current state of G. The last component is 1 usually. It is 2 when A 
would have just moved its input head right, and it is 8 when A would 
be scanning ¢ or §. 

Define F, = {[q, p, « ¢, 7}] qin F, 7 = lor 3}. Also, gq; = [qo, Do, 
€, €, 3}. 

Define f, by: 


(t) (fs)te.n.a.u.i] = fa if either a ¥ cori = 38. 
(it) (f1)tq.r.0.u.i) = @” otherwise. 


For all min M, bin J, — {¢, $}, qin S and p in K, define: 






FINITE 
CONTROL. 






FINITE 
CONTROL 
OFA 


SIMULATED INPUT 










Fig. 4— Automaton Ai. 
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(t27) gi(lg, po, € €, 3], ¢, m) contains ([s, po, €, €, 3], 0) if (s, 0) is 
in g(q, ¢, m). 

(iv) 9:([¢, Po, € € 3], ¢, m) contains ([s, po, ¢, €, 2], +1) if (s, + 1) 
is in g(q, ¢, m). (A; Gurulates A with ¢ as input.) 

(v) g.([¢, p, & & 1], 6, m) contains {([s, p,, a, u, 1], 0) | for any a 
in I a {¢, $}, if (s, 0) is in 9% a, m), Pi = 5(p, a), and u>= A(p, a)}. 
Likewise, if b = $. 

(vt) 9. ((a, DP, €& &, if; b, m) contains {([s, Pi, a, U, 2], 0) | for a in 

— {¢, $}, if (s +1) isin gq, a, m), p, = 6(p, a), and u = X(p, a)}. 
Likewise, if b = $. (The random generator generates symbol a, which 
is stored in the third component. \(p, a) is stored in the fourth. The 
new state of A, with a as input symbol is stored in the first component, 
and the new state of G in the second. If A would immediately move 
its input head right, the fifth component is 2. A 2 there tells A, it is 
finished with symbol a. Otherwise, a 1 is placed in the fifth component.) 

(vit) g.([g, p, a, u, 1], b, m) includes {([s, p, a, u, 1], 0) | (s, 0) is in 
g(q, a, m)}. Likewise, if b = $. 

(viet) gi(lg, p, a, u, 1], b, m) includes {([s, p, a, u, 2], 0) | (s, +1) 
is in g(q, a, m)}. Likewise, if b = $. (A, simulates a nove of A. The 
fifth component of A,’s state becomes 2 if the input head of A moves 
right.) 

(tx) gi(la, p, a, u, 2], b, m) = {([g, p, ¢ u, 1], 0)}. Likewise, if b = $. 
(Remove a as third component and set fifth component to 1.) 

(x) For any u, gi(la, p, ¢ bu, 1], b, m) = {([a, p, ¢ u, 1, +1}. 

(xt) gi([g, p, ¢, bu, 1], b,, m) = ¢ for b, ¥ b. (A, compares its fourth 
component with the input.) 

(xit) gi(lg, p, ¢, «€ 1], $, m) contains ([s, p, ¢, ¢, 3], 0) if g(@, $, m) 
contains (s, 0). 

(xitt) 91 (4, P, & &, 3], $, m) contains ([s, P, & & 3], 0) if 94 $, m) 
contains (s, 0). 


We will state a series of intermediate results that follow directly 
from the rules given. We assume that G(w) = z. 

I. By rules (¢) and (#i7): ([90, Do, € &,3], ¢x$, 0, 1) (la, Pos & & 3], 
ée$, 0, 4) if and only if (qo, ¢w$, 0, 1) [5 (a, ¢w§, 0, 2). 

II. By rules (i), (72), (w), and (7x): ([g, Po> €& &; 3], éx$, 0, 1;) las 
([s, Po; ¢, €, 2], ¢x$, 1, 12) va ([s, Po,» € €; 1], éx$, 1, 12) if and only if 
q, ¢w$, 0, 11) vt (s, ¢w§, 1, 1g). * 

III. By rules (2), (v) and (viz): ([g, p, ¢ ¢, 1, ¢x$, 3, 41) la, (ls, 7, @, u, UI, 
¢x$, j, t2) by a sequence of moves for which the third component of 
state never becomes «, or the fifth = 2, if and only if (q, ¢w$, k, 71) 
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= (s, ¢w$, k, 72), where the kth symbol of w is a, 6(p, a) = r and 
A(p, a) = u. 

a From III, and rules (2), (vz) and (viiz): ((¢, p, ¢ ¢ 1], ¢x$, 4, a1) 
lz, ([s, 7, a, u, 2], ¢x$, 7, 22), by a sequence of moves in which the third 
component of state never becomes « if and only if (q, ¢w$, k, 1) I= : 
(s, ¢w$, & + 1, 2) by a sequence of moves in which the input head 
remains at sesition k until the last move is made. Here, again, the 
kth symbol of w is a, 6(p, a) = r and X(p, a) = u. 

V. From rule (zz), ([g, p, a, u, 2], ¢x$, i, 2) Iz, (Ig p, & % 11, ¢2$, 3, 2) 
for any a ¥ ¢, and if the fifth sorpenenta is 2, no other move is possible. 

VI. From rule (2), (1g, P, 6% 1, $28, x4) EE, (a, B, & 6 1, £28, fn, 4) 
by a sequence of moves for hich the third component of state remains 
e, if and only if symbols 7, through j. — 1 of xz Ha U. 

VII. Combining IV, V, and VI: ({g, p, ¢ «¢, 1), ¢r$, j1, 71) Is re 
é, €, 1], ¢x$, Jo , 72) by a sequence of moves in ae the third ee 
changes from ¢ to a symbol in J — {¢, $} back to e only once, if and 
only if for some ain J — {¢, $} and win (J, — {¢, $})*, we have 

(a) d(p, a) = 7; 

(d) A(p, a) = U; 

(c) Symbols j, through jo — lof x areu; 

(d) (q, ¢w$, k, 7,) 5 a (s, ¢w$, k + 1, 72) by a sequence of moves in 
which A’s input head remains stationary until the last move, and 
a is the kth symbol of w. 

Note that j, = j. = n + 1 is not prohibited. 

VIII. Using I, II, and VII iterated: ([¢0, po, & € 3], ¢x$, 0, 1) hi Pe 
(lq Dp, 6 & 1] 1, éx$, n + 1, 7), where |x| = n, if and anly if, for some 
win (I — fd. $})*, of length k: 

(a) (ao; ae 0, 1) = a1 (q, ¢w$, & + 1, 7) by a sequence of moves 
in which A’s input head does not reach $ until the last move; 

(6) Spo, w) = D; 

(c) (po, W) = =. 

IX. Directly from VIII, A, accepts ¢x$ by entering a state [q, 7, €, ¢, 1], 
where gq is in F, if and only if there is a w as in VIII such that A accepts 
¢w$ by entering state g on the same move on which A first moves 
its input head to $. 

X. From rules (x77) and (zizz), ([¢, p, € ¢& 1], dw$, 2 + 1, 7,) = Zo Ss D3 
e, ¢, 3], ¢x$, n + 1, 22), where |x| = n, by a sequence of moves in 
which the third component of state remains «, if and only if (g, ¢w$, 
k + 1, 7,) = (s, ¢w$, k + 1, 2), where k = | w |. 

XI. From VIII and X, A, accepts ¢x$ by entering a state [s, p, €, ¢, 3] 
if and only if A accepts ¢w$, where \(p., w) = 2, by a sequence of 
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moves in which A enters state s while its input head remains scanning $. 
XII. Finally, from IX and XI, we have that A, accepts ¢x$ if and 

only if A accepts ¢w$, where \(p,, w) = x. Thus, 7(A,) = ¢G(L)$. 
We need only add that A, is, by definition, in class C. 


Corollary: If L = T(A) for some INBA, A, in class C, and G is a g.s.m., 
then there is an automaton A, in class C, for which T(A,) = G(LZ). 
If class C ts recursive, we can effectively find A, . 


Proof: Direct from Theorems 7 and 8. 


Theorem 12: Let L = T(A) for some 1NBA, A, in class C. Let k be 
a regular set. Then there is an automaton, A;, in class C, with T(A3) = 
L/R = {w| for some x in R, wx is in L}. 


Proof: Let A = (S, I, M, f, 9, h, @, F). We can surely find a finite 
automaton, A, = (K, I — {¢, $}, 5, po, F:) accepting RM (I — {¢, $})*. 
Intuitively, A; will simulate A, but will always have the additional 
choice of guessing that it has seen w. It then nondeterministically 
chooses the symbols of x, continuing to simulate A. We will construct 
A, to accept L/R — {e}. The reader can easily see how e can be added 
to the set accepted by A2. 

Formally, let A. = (S2, J, M, fe, 92,h, Qo, F2). Ss = SU {lIg, p, al | ¢ 
in S, pin K, ain — {¢, $} ora = e}. F. = {[q, p, | | qin F, pin Fj}. 
Define f, and gz as follows: 


(2) (foe = fa for gin S. 

(7) (fo)te.>.a1 = fz for allgin S, pin K,ainJ — {¢, $} ora = «. 
For all b in J — {¢, $}, min M: 

(722) go(q, 6, m) contains (s, d) if g(q, 6b, m) contains (s, d). 

(iv) g2(q, b, m) contains ([s, Po, €], +1) if g(q, b, m) contains (s, +1). 

(v) go(q, b, m) contains ([s, p, a], 0) if g(g, b, m) contains (s, +1) 
and 6(po, a) = p, for any ain J — {¢, $}. 

(vt) go([q, p, a], b, m) contains ([s, p, a], 0) if g(q, a, m) contains (s, 0). 

(viz) g2([q, p, a], b, m) contains ([s, r, a,], 0) for any a, in I — {¢, $} 
if g(qg, a, m) contains (s, +1) and 6(p, a,) = °. 

(vitt) go([q, p, al, b, m) contains ([s, p, «], +1) if g(g, a, m) contains 
(s, +1). 


Note that no moves are possible if the third component of A,’s 
state is e. From rules (2) and, (777), we see that (q., w, 1, 1) |z, (@, w, J, 2) 
if and only if (q., w, 1, 1) |; @ w, j, 2). Let w be of length n. By rules 
(at) and (iv), (q, w, n, 2) lz, (*) if and only if p, is in F, (e., € is in R) 
and (q, w, n, i) [z (*). 
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By rules (iz) and (v), (q, w, n, 7) |Z, ([s, p, a], w, ”, 2) if and only if 
(q, wz, 0, 41) |Z (s, wx, n + 1, 22), where the first symbol of x is a and 
5(po , a) = p.t Let | x | = k. Then, by rules (72), (v7), and (v77), ((, p, al, 
w, n, 11) |x, ([s, 7, ai], w, n, 22) if and only if (q, wz, n + 1, 2) |; (s, wa, 
n+ k, 2), 6(p, x) = r, and x ends with a,. 

By rules (77) and (vizz), ([g, p, a], w, n, 7) |z, (*) if and only if (q, wa, 
n+ k, 2) |z (4), where the last symbol of z is a, and x is in R. 

Putting the above together, we see that (qo, w, 1, 1) |, (*) without 
ever entering a state of the form [q, p, a], a # € if and only if ¢ is in 
R and (q, w, 1, 1) |x (*). Also (qo, w, 1, 1) |a, @), entering a state 
[q, p, a], a # ein so doing, if and only if for some x in (I — {¢, $})*, 
(q,, wx, 1, 1) |g (*) and wz isin R — {e}. If A, is modified to accept e, 
provided ¢ is in L/R, then the resulting device is A; . 








VIII. CONCLUSIONS 


We have considered four types of general automata, and defined 
closed classes for each of these four types. We have shown certain 
common operations to preserve these classes, in the sense that if a 
language, L, is accepted by an automaton in the class, and Ly is the 
result of the operation applied to L, then L, is accepted by some 
automaton in the class. 

The classes model many of the common devices which have been 
heretofore considered in the literature, such as stack automata and 
counter machines. It seems as though they could be expected to 
model any future class of automata which are defined solely by the 
ways in which their infinite storage can be locally manipulated. The 
classes do not model such things as linear bounded automata or 
time/tape complexity classes of Turing machines, intuitively because. 
such automata are defined by global restrictions on memory. (I.e., one 
may use “this much” memory, and no more.) 

In Table I, we list the types of balloon automata and the opera- 
tions considered. A check indicates that the operation preserves 
membership in a closed class of automata. 

It is hoped that when models of automata are proposed in the 
future, theorists will find it efficient to show that their model is 
equivalent to a closed class of balloon automata. They will then have 
a variety of standard theorems already proven for them. 


+ Note, however, that the first symbol of x does not affect the operation of A 
at this step. 
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TABLE [ 
2DBA 2NBA 1DBA 1NBA 






























































Reversal / «“/ 
Intersection = == =| a an cars 
eee a er ee 
Union , Y, 
Intersection with regular set V/ JV JV VJ 
Coneatenation (-) J 
Kleene closure (*) ies eee) (ae: Hee’ ae 
g.s.m. forward oat eer PG a a 
Quotient with regular set (/) a i a ae 


IX. FUTURE PROBLEMS 


There are various theorems about automata that have not been 
reflected in the results on balloon automata. For example, one-way 
deterministic pushdown automata are closed under complement. It is 
probably true that all common types of one-way or two-way de- 
terministic automata are closed under complement, although proofs 
have not been published in all cases. Likewise, many one-way de- 
terministic devices are closed under quotient with a regular set. Most 
one-way nondeterministic devices seem to be closed under reversal, and 
so on. 

We therefore propose as an interesting and worthwhile problem, the 
question of putting additional restrictions on closed classes of balloon 
automata such that some or all of these results can be proven. Of 
course, the conditions must be liberal enough so that the usual auto- 
mata are still modeled. 

Second, it would be useful to have a model, like the balloon auto- 
maton, which could describe, as closed classes, such things as linear 
bounded automata and computational complexity classes. The prop- 
erties of these classes deserve some treatment, and an approach sim- 
ilar to the one taken here might be a reasonable one. 

It is hoped that the methods we have used to prove certain 
theorems plus the fact that we could not prove some others will shed 
some light on why some theorems are hard to prove, or visualize, while 
others are easy. Specifically, we have an indication as to why certain 
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theorems seem easier to prove for nondeterministic devices than 
deterministic. 
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Extensions to the Analysis of 
Regenerative Repeaters with 


Quantized Feedback 


By M. K. SIMON 
(Manuscript received May 23, 1967) 


The functional iterative approach given by Zador for calculating the 
average bit error probability in a regenerative repeater with quantized 
feedback is extended to the vector case. For a channel with a rational frac- 
tion transfer function, the vector extension permits us at least formally 
to deal with the following practical conditions: 


(t) The pulse transmission plan is described by an m-ary alphabet 
with independent digits. 
(it) Perfect and imperfect low-frequency tail cancellation cases are con- 
sidered. 
(iii) High-frequency signal shaping and its interaction with the pre- 
dominantly low-frequency tail are taken into account. 


Expressions for error probability on the kth digit are derived in terms of 
the kth vector iterate of a known function. The restriction to independent 
noise samples is also removed. The resulting expression for kth bit error 
probability is then derived from an operational iteration procedure which 
acts on the k + 1 dimensional joint distribution of the noise samples. 


I. INTRODUCTION 


In the design of digital communication links, various reasons exist 
for the removal of low-frequency components during or prior to trans- 
mission of a pulse train. In the case of vestigial sideband (VSB) 
modulation? of data over voice-frequency channels, the de and low- 
frequency signal components are removed at the transmitter before 
modulation and carrier reinsertion. This is required to insure satis- 
factory carrier recovery at the receiver for relatively low transmitted 
carrier power. In the T-1 Carrier System,” the loss of low-frequency 
information results from transformer coupling of an unbalanced 
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repeater to the balanced line. In either event, the effect of low-fre- 
quency suppression is to cause the positive impulse response of the 
overall equalized medium to exhibit an undershoot which gives inter- 
symbol interference. 

One means of reducing the effect of low-frequency suppression in a 
regenerative repeater is to feed back a signal in an attempt to cancel 
the long transient tail. This method of compensation has been called 
quantized feedback and its use dates back to the 1920’s (as noted by 
Bennett?). We assume that the reader is familiar with Bennett’s ex- 
cellent expository paper. Until recently, analysis of the effects of 
quantized feedback on average bit error probability in a noisy en- 
vironment has received essentially no attention. The first to examine 
this problem were Anderson, Gerrish, and Salz* who considered the 
polar binary case, neglecting signal shaping and assuming perfect 
matching of the feedback cancellation signal to the input signal tail. 
They have obtained results, with the aid of the computer, that have 
provided insight into the problem. In addition, they have exposed com- 
putational difficulties involved in grinding out numerical results for 
any given set of system parameters. 

A more analytical approach to the basic problem is found in Zador?® 
who used the theory of generalized random jump processes® to obtain 
an iterative procedure for computing error probability. Unfortunately, 
the class of physical systems that can be handled by Zador’s approach 
as originally stated is quite restrictive in the following sense (see 
Fig. 1): 


(1) The transmitted message sequence is composed of independent 
binary digits. 

(it) The low-frequency behavior of the channel as represented by 
G(s) is dominated by a single pole. 

(iz) G(s) and H(s) are exact complements of each other so that 
perfect feedback tail cancellation is achieved. 

(iv) The time dispersion of the transmitted pulses caused by the 
medium, C(s), with or without equalization H(s) is strictly limited to 
two pulse intervals. 

(v) The noise samples at the input to the threshold detectors are 
assumed independent. 


It is our intention here to remove some of the above restrictions. In 
particular, we extend Zador’s approach along the following lines: 


(1) By allowing a multilevel threshold device as a regenerator, the 
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Fig. 1— Block diagram of reconstructive repeater with quantized feedback. 


allowable pulse transmission plan is extended to include m-ary alpha- 
bets with independent digits. (The ternary case is treated in detail.) 

(1) The high- and low-frequency behaviors of the channel may be 
individually characterized by rational functions. The implication of 
this is twofold. First, the predominantly low-frequency tail is now 
described by several exponentials. Secondly, the impulse response of 
the overall equalized medium C'(s) E(s) is not restricted to be time- 
limited. 

(122) The restriction to perfect tail cancellation is removed to allow 
for imperfections in the forward and/or feedback paths. 

(tv) The more realistic case of correlated noise samples is examined. 


Extensions (2), (7), and (wz) are possible only through a vector ap- 
proach based on Zador’s original iteration scheme. The assumption of 
a nonflat noise spectrum as in (2v) leads to an operational iteration 
procedure for calculating bit error probability. It is to be emphasized 
that the question of computational procedures, which even in the sim- 
ple binary case was a formidable task, grows considerably in com- 
plexity with the degree of generality assumed. 

The generalizations listed above will be treated one at a time so as 
to demonstrate individually the necessary changes in Zador’s original 
formulation. A review of his model is given in Section II. 

Section III assumes an unrestricted ternary message sequence to- 
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gether with the remaining restrictions as imposed by Zador. An ap- 
plication of the results is given for a particular high-frequency behavior 
of the system. The response of the high-frequency portion of the chan- 
nel, C(s) E(s), to a transmitted rectangular pulse is assumed triangular 
in shape and time-limited to two pulse intervals. 

Section IV derives the general expression for error probability when 
the overall channel, Y(s) = C(s) E(s) G(s) is assumed to be charac- 
terized by a rational function. The feedback network, H(s), is de- 
signed to cancel only the low-frequency poles, i.e., those of G(s). The 
special case of a binary input format is treated in detail. 

Section V modifies the results of Section III by including the case 
of imperfect match of the G(s) and H(s) characteristics. 

Section VI begins with Zador’s original assumptions on the signal- 
ing format, channel, and feedback network characteristics, but re- 
moves the restriction of independent noise samples. An expression for 
kth bit error probability is derived from an operational iteration pro- 
cedure which acts on the & + 1 dimensional joint distribution of the 
noise samples. The analogy between this scheme and the functional 
iteration proposed by Zador for the uncorrelated noise case is demon- 
strated. 


II. REVIEW OF ZADOR’S MODEL 


We begin with a brief review of Zador’s mathematical assumptions 
and emphasize their physical significance. Consider once again the 
repeater-to-repeater transmission link illustrated in Fig. 1. The out- 
put of the nth repeater at time r7’ is a binary rectangular pulse* 
d,p(t—rT) where 


pH) =p |t|st 
= Q, [be | dy 


d, = +1, and 1/T is the pulse rate of the system. Zador does not ex- 
plicitly describe the high-frequency behavior of the system. The class 
of channels that satisfies his underlying assumptions is discussed below. 
Let the response of C(s)E(s) to the pulse p(é), denoted by z(t), be time 
limited to 27, and zero at its end points. It is understood that in practice 
these conditions are usually met only approximately. Then, by passing 
2(t) through a single pole high-pass filter, G(s), the part of the resulting 


* Zador assumes +1 impulses as repeater output. As we shall see, in the sam- 
pled systems we consider, this modification has no effect on the ensuing analysis. 
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response, g(£), for £ = 27 is dominated by a single exponential. If s(t) 
is sampled at time ¢ = T and held until t = T + &, then by employing 
an ideal slicer element as a threshold detector a unit rectangular output 
pulse, b(t) is regenerated. Furthermore, by passing this pulse through 
H(s), the response tail of s(t) for t = 27’ may be exactly cancelled in 
the absence of noise and circuit imperfections. These observations are 
illustrated in Fig. 2 for a triangular pulse shape z(t). The response of 
H(s) to the regenerator output pulse is denoted by h(t). Turning now 
to a sample notation, let g, , h, , and b, represent the values of g(t), h(é), 
and b(t), respectively, at time (kK + 1)T, k = 0, 1, 2. Then, from Fig. 
2, it is obvious that the following conditions must hold, in general, in- 
dependent of the waveshape of 2(¢) within the 27 interval: 


(2) Jo > 0, ho =— 0 
(12) h; +9; = 0 += 1,2,-- 


(771) gi = 'Gi-1 122 


h(t)=-g (t) 
t2 aT 


h(t) —> 





Fig. 2—System pulse responses. 
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where r is related to the single pole, a, of G(s) by r = e7*". Con- 
dition (222) is clear upon noting that the response of a single pole high- 
pass filter to a time limited signal of width 27 has a single exponential 
response for values of t = 27. Statements (2) to (727) as above are 
identical with Zador’s restrictions on the system as reported in Ref. 
5.* The shape of z(t) is solely used in determining the two dependent 
quantities g, and g,. For a triangular z(t) waveshape of unity height 
(Fig. 2) and G(s) = s/s + a, 


ee 
go = ll é ] 

—~ Ly _ erp 
f= on é es 


III. TERNARY PULSE TRANSMISSION 


When considering a ternary system, the only essential modification 
of the model suggested by Zador is an ideal slicer with positive and 
negative pulse detection thresholds set at +a) and —aj, respectively. 

Letting s, denote the total reshaped input at the k + 1th timing 
instant, and c; the feedback voltage at the same instant in time as 
before, the slicing operation is described by 


b, = 1 if gs tm+to 2 ao 
= 0 tf -a<stmta <a 


—1 if S& +mta Ss -a, 


where 
k 


S = Dy on-ds k =0,1,-:-- 


+=0 


k 
G = di hy-ib; &=0,1,-°°- 

+=0 
and 7; is a sample from a stationary noise process n(t) having a fixed 
but arbitrary distribution function N(x), and independent samples. 
The process n(t¢) is actually the result of passing the additive white 
noise process in the system, &(¢), through H(s). We assume, however, 
that the correlation between noise samples introduced by the above 
is small and can be ignored as a first approximation. When this as- 


* Note that Zador also requires gi < for 1 2 1. This restriction is not neces- 
sary although it is often true. 
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sumption is invalid, the method discussed in Section VI must be used. 

It is of prime interest to examine the conditions under which the 
system will operate error-free in the absence of noise. For 0 < a, 
% S%,; 


So + oo = Godo 
thus if, 
a So + oo = go; bg = 1 
= 0, So +c = 0, i= 
= —I], So +& = —-9Jo; b = —I, 
OF “Da = 05-5 
Continuing, in this way k = 1,2, - as Py eae 


k-1 
& +, = Jo + SD (Qn—i a h,-.)d; 
i=0 


= Joly . 


Thus, if b,, = d, form =0,1,- +--+ ,k—1, then b, = d; and the sys- 
tem operates error-free in the absence of noise. 
For the more general case when noise is present, 
k—-1 
& +a = ye guild; — b:) + gods = te + God: , 
binds 
where x, represents the cumulative effect of any and all errors prior 
to time k. 
Letting p and q denote the a priori probabilities of a plus one and 
minus one, respectively, the probability of error on the kth digit p(k) 
can be written as 


p(k) = p Prob {n, +2, S a — go} + q Prob {m +2, > —a: + go} 
+ (1 — p — q) Prob {n, + 2 = ao ;m +2, S —ay}. 


The independence of 7; and 2; allows p(k) to be expressed in terms of 
the noise distribution function N(x) and the distribution function of 
xy, F(x) as follows: 


pt) =p | N(@ = go ~ 2) ax) 
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taf 1 -N@ a ~ 2) dF@) 


+d1-—-p-@q [ [N(—a, — x) + 1 — N(a — 2)] dF (2). 


For the case of a zero mean symmetrical noise distribution and equal 
@ priort probabilities for all input symbols (i.e., p = q = (1 — p — g) it 
is easy to show that the optimum threshold settings are +g,/2 with 


pl) = =a) [1 - NGo/2 — 2) + N(—oo/2 - DAP), 


It now remains to show that the sequence of random variables 
Y,%1,° * * are representative of a random jump process studied in 
Ref. 6 and thus p(k) can be expressed as the kth iterate of a known 
function evaluated at 29 with a finite limit as k > oo. 

Consider, 


k 
Let1 = >» Jr+i—i(d; = b;) 
i=0 


k-1 


= gi(d, — b,) +  rgu_d; a b,) 


= 


Un+1 = gi(d;, vs b,) eee 


There are five possible transition states each of which takes place 
with probability depending on the value of 2, . 


Ifd, = 1, b, = —1, then x,,, = rx, + 2g, with probability p,(z2;,). 
Ifd, = 1, b, =0 


or , then x4; = rx, -+ g, with probability p.(2;,). 
d,=0, bh = —1 
If d, = b, , then 24, = 7x, with probability ps(2;). 
If d, = —1,b, = 0 
or , then 2,4; = rt — g: with probability p,(2,). 
d,=0, b, =1 


Ifd, = —1, b, = 1, then 2,,, = ra, — 2g, with probability p;(a;). 
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The transition probabilities p,(x,), m = 1, 2, ---, 5 are defined by 
piltr) = pN(—a, — go — x) 
P(t.) = pIN(@o — go — tx) — N(—ai — go — 2)] 
=p = QINCH ap =a) 
ps(a.) = 1 — pits) — poe) — pale) — Dox) 
pate) = GIN(ao + go — &%) — N(—ai + go — %)] 
+ (1 —p— @[l — N(@ — 2)] 
Ds(xx) = Q{l — N(@o + go — 1)). 
Note, 


p(t) = [ (eG) + pala) + pile) + pola)] dP). 


Defining U*[f(x)] = pi(x)f(re + 291) + po(x)f(rz + gi) + ps(x)f(rz) 
+ palx)f(rx — gi) + ps(x)frx — 291) 
and denoting the kth iterate of U*[f(x)] by U'[f(x)], 
p(k) = U*[pi(x) + po(z) + pila) + ps(2)] |enseno - 
If A(z) is the limiting distribution of F(a”), then 


Lim p08) = | tp) + role) + r.@) + ps] 2G) 


Lim U"[p,(x) + p(x) a p(x) + ps(x) | leeexes : 


A few remarks are now presented to indicate the obvious extension to 
the m-level (m-ary) pulse transmission scheme. A random Jump proc- 
ess with 2m — 1 transition states will result requiring an iteration func- 
tion U*[f(x)] having 2m — 1 terms. It should be indicated that com- 
putationally the amount of computer storage or operations required to 
evaluate p(k) is of the order (2m — 1)*. 


IV. RATIONAL FUNCTION APPROXIMATIONS OF THE CHANNEL AND 
FEEDBACK NETWORKS 


As the subtitle indicates, we are interested here in studying the 
repeater error performance under the assumption of a rational func- 
tion approximation to the channel and feedback networks. This gen- 
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eralizes the assumptions of Section III in that (7) the tail of the pulse 
response, g(t), is no longer described by a single exponential, and (1) 
the high-frequency behavior of the channcl allows its time response to 
exceed two pulse intervals. To isolate these effects, however, perfect 
feedback tail cancellation is still assumed and we return to a binary 
message format. 

It is convenient to represent the output rectangular pulses of the 
nth repeater as the impulse response of a filter F(s) = (1/s)[1 — e°°"?] 
where é) is the pulse width. Including this filter in the forward path 
of Fig. 1, the overall channel link between repeaters, 7'(s) = F'(s)C(s) 
E(s)G(s), is assumed to be characterized by a rational function as 
follows: 

AM 
T(s) = Go § x eee <a 


IT @ + «) IT @ + 6) 


with its associated impulse response 
AL N 
g(t) == oS Aes zit y Be. 
t=1 t=1 


Note, the impulse response of 7'(s) is the same as the rectangular 
pulse response of Y(s) = C(s) H(s) G(s) and is thus denoted as be- 
fore by g(t). All poles are assumed to be simple, but in general may be 
complex. The terminology used henceforth will refer to the set {a}, 
1=1,2,-- +, M as low-frequency poles and the set {8;},1 = 1, 2, 
- + + Nas high-frequency poles. The inference here is that the £;’s are 
predominantly responsible for signal shaping and the a;’s determine the 
low-frequency cutoff of the channel. 

A low-pass quantized feedback path H(s) is proposed which in the 
absence of noise would provide perfect low-frequency tail cancellation 
at all sampling instants beyond the input pulse peak (the effect of im- 
perfect low-frequency compensation will be discussed in Section V).* 
Thus, if 








qo Sa Oe 2, 


IT + a;) 


where 79 represents the physical delay in the feedback path beyond the 
occurrence of the input pulse peak at & = tnax, then, the response to a 
*Jt is to be emphasized at this point that all of the following is easily gener- 


alized in terms of MacColl’s conception of quantized feedback’ wherein restora- 
tion of both low- and high-frequency signal components is attempted. 
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positive regenerator output pulse at ¢ = fmax would be 


M M 
i) = > Deke ne Se Fore Pe bey 
i=1 i= 


t=1 


Ideally, for perfect low-frequency tail compensation, we desire 


M AL 
DL be aD Ag SQ 
t=1 


t=1 
at all instants tmax + nT, n = 1, 2, +++, where T is the uniform sam- 
pling period. 
Letting h; and g; represent the values of the pulse responses h(¢) 
and g(t), respectively, at the kth sample point the above statements 
may be expressed in brief as follows: 


N 
(1) he to:= Xen t40 


So) 2oee 
Cain = Znli-1,n 
4122 
Con = 0 
at ae 
M 
(iii) ho = Yohan 
n=l 
n = 1, 2 -, M 
iin a Te Mene % : 2 : 
1722 
Poe 


The term e;,, represents the residual intersymbol interference at the 
ith timing instant due to the nth high-frequency pole. To simplify 
what is to follow and at the same time allow a better comparison with 
the previous work of Zador, we introduce the following vector nota- 
tion: 


ry O O oh satu ceuetes oe O 21 @) O one ub Naveen ec 0 

Ot Mgstanewaten 0 Oe By eeeticen ae 0 
R= - 1; Ze 

0 ; 0 





LO ene e eee es ee se oe y Mu e) PO ER BS LAN OS: Be 2 N 
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hor hiya he hs. sere his 
FF No,» hive he,» hs,» Hes his 
ho, ar Riva hear hs, ar tte Near 
Jo,1 Ji 92,1 93,1 caine Jey. 
‘ase Jo,2 91.2 J2,2 73,2 ea Jk ,2 
esos 9i,m+N Qeo,men Qa,m+en °°° Qk, m+n_] 
O Ci C21 C331 gts ei. 
E=- O 1,2 €2,2 3,2 ‘'7* Cx,e 
O €i.n @2,n €3,n *7* Caw 


Using H as an example, the ith row written as a column vector is de- 
noted by hi and the ith column by the vector h; . Also any vector written 
not in bold face is by definition the scalar representing the sum of its 
elements (e.g., h*’ = > *_, h,,;). Finally, we denote the column vector 
obtained by summing all rows of H (i.e., whose ith component is h;) 
by h. All of the above statements’ are equally applied to the matrices 
R, Z, G, and EF. 

In terms of the above, (2), (72), and (77) may now be rewritten as: 


(2) h, = 0, Jo > 0 
(i) h+g=e 

e; = Ze._, i= 2 

(127) h; = &h;-,, i 2 2. 


Some further interpretation of the above statements in terms of 
the actual system operation might prove helpful at this point. (2) 
indicates a positive input pulse peak (g. > 0) and a delay in the feed- 
back path (do = 0). Statements (77) indicate that perfect feedback tail 
cancellation is achieved at the sample points starting with the second 
except for the effects of the high-frequency poles (6:, 62, -** , By). 
In contrast to the previous sections, we do not assume that the high- 
frequency components of the response have died out before the oc- 
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currence of the next input pulse peak. Feedback cancellation of only 
the channel low-frequency poles is described by statements (772). 

For the binary message case, the threshold detector box of Fig. 1 
reduces to a simple ideal slicer element operating between +1 and 
—1 levels. The input sequence {d;} is a random train of +1 and —1 
impulses represented by the vector d with elements d; . 

The total reshaped input at the & + 1th timing instant, s, , and the 
feedback voltage at the same instant, c, , are described by, 


s, = (d * g), k=0.1.2.-- 
c, = (b *h), 


where the kth element of b, b, = sgn {s, + ¢, + n,} is the kth regenerator 
output digit. The notation (a * b), represents the convolution of two 
k + 1 dimensional vectors a and b (i.e., >o* 29 aiby-:). 

Considering first operation in the absence of noise, we see by in- 
spection b) = d, . (This tacitly assumes that no intersymbol interference 
due to precursors is present.) Proceeding as in Zador,’ if b,, = d, for 
m=0,1,---,k—-—1, then 


Sr $C = (d * e); = god, + (d * @)i-1 : 


From this, one concludes that if 
k-1 
Jo > > |e; |, 


then b, = d, and the eye is open. The system will therefore operate 
error-free in the absence of noise for any length input sequence if 


Jo > 2 le; |. 
If all N high-frequency poles (6, 62, -:: , By) have positive residues, 
the above criterion reduces to 


N 
Exon ; 
go > 27 = 2, 


The above implies that the eye is open if the total high-frequency 
contribution at all sample points beyond the firstis smaller than the 
pulse peak. 

More specifically, the values of e,,, and 2, may be related to the 
allowable amount of degradation of the eye. That is, for any eye which 
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is X percent closed. 


Gin xX 
~j—z  100%° 





Mez 


Turning now to the more realistic situation in the presence of noise 


Sp + Cy = ody + (d * 2-1 =e (b * h),-1 . 

Consider subdividing the vector d into two parts d’ and d” in such 
a way as to separate the input digits into two classes corresponding 
to b; = d; and b; ¥ d; respectively. That is, 

= 0 = d; 145 b; # d;}. 
(Obviously d = d’ + d”.) 
Then, using (72), 

u, = (d * TP ies ;V, = —2(d * H"),-4 

Si + Ce = God, + Ue + %, 
= god, + 2X, 


where (d * @),_1 is a vector whose ith component is the convolution of d 
with theith column of G. Again omission of the bold face notation indicates 
summation over all the components and 7’ is the transpose operator. 

The first term in x, denoted by wu, represents intersymbol inter- 
ference due to residual high-frequency tail components irrespective of 
previous decisions. The second term x, again represents the cumulative 
effect of any and all errors prior to time k. 

The expression for error probability on the kth digit is identical 
to that given by Zador, namely, 


pt) =p | N(—go— 2) dF) +4 f [ - No ~ 2) aA). 


The only difference being the nature of the distribution function F’,(z). 
The recursive properties of the intersymbol interference x, are now 
examined. 


Tea. = (A * EH), — 21d” * H’), . 
If b, # d,, then 
Tear = (€; — 2hi)d, + z’(d * E7),_, — 2r7(d” * H’),_, 
= (e, — 2h,)d, + zun +r’ve . 
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If by = di. ) then 
Lear = Cd, + z(d ig E*),-1 = or"(d’”’ bs AY: 
= 6d, + 2 Ur +Ir'Vy . 


Letting a = —2h,, the intersymbol interference sequence 2%, % , 
Z_ , +++ may be expressed as a random jump process,”’° with the following 
transition states: 

If d, = 1 ¥ b,, then with probability p,(«;) 


T. T. 
Yer. = ZU, $e, +r ve +a. 


If d, = 1 = b,, then with probability p.(z,) 


T Tr. 
Tae = Zp fey FT VE 


If d, = 1 ¥ b, , then with probability p,(2,) 


Vrit = ZU = ey. +r'v, = 0, 
If d, = —1 = b,, then with probability p,(a,) 


T. T. 
Levi = ZUy — €, +I'Vy , 


where 


Pit) = PN(—go — 21) 

pot) = pil — N(—go — %)] 
ps(t) = gl — N(go — 2:)] 
Diltx) = gIN(go — 2%)]. 


In the above, N(x) is the distribution function of the stationary noise 
process, and p and q are the a priort probabilities of a plus one and 
minus one, respectively. In terms of the above elementary probability 
density functions, the error probability on the kth digit may be ex- 
pressed as: 


pt) = | tosGe) + pole] dP), 


We propose a vector extension of Zador’s procedure, namely; an M + N 
dimensional iteration scheme in which each of M + N variables is re- 
placed by a linear transformation on itself during each iteration. To 
elucidate the meaning of Mf + N dimensional iteration and at the same 
time recall some of our earlier vector notation, the first-order itera- 
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tion function U'f is written in summation notation as: 
N AL N M 
U'{(0, 9) = ri > 6,+ do znd (2nn + etn) + a (Pntom an) | 
n=1 m=1 n=l m= 
N M 
yale Cs aa > (nen) | 


n=] 


+> ee Gs +> 


n=1 on 


7 lb, — 61.) + 3 Gem — a) | 


n=1 m=1 


It follows that the probability of error on the kth digit is 


p(k) = U*' [py + pal fo, ¢-0 


where U* is the kth M + N dimensional iterate of Ut. The convergence 
of p(k) in the limit as k — o has not been examined for an M+ N 
dimensional branching process. From Zador’s work on one-dimensional 
branching processes® we may conjecture that absolute system stabil- 
ity (i.e., all poles in left-half plane) implies convergence in the multi- 
dimensional case. 

Although the notation in the foregoing analysis appears formidable 
(quite an understatement) the procedure and its usage are straight- 
forward (at least analytically) for a particular example. At the ex- 
pense of being redundant, we once again point out that even in Simple 
cases, numerical results are hard to come by. 


V. IMPERFECT LOW-FREQUENCY TAIL CANCELLATION 


It is relatively simple at this point to include the effect of imperfect 
low-frequency cancellation in the results of Section IV. As an example, 
such a phenomenon might be caused by a delay of amount 7+ in the 
feedback path. Defining an LZ matrix by 


0 La ly 1 bs 1 ne Li 
0 bis i las re Oe bes 


Pe ae ae 


Li sar Ly ar ls ar Aas Lar 
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where 


Lim = Ian — exp (— Flog, 1) m=1,2,---,M, 


statement (iz) of Section IV may be modified as follows: 
@i) h+tg=e+l1 
1, = Rie; 5 a 
Qe; = Zei-1. 
The effect of this on the recursion relationship for x, is as follows: 
If b, + d,, then 
Lew = (, + 1, — 2hid, + zn +r’vy +1’. 
If b, = d, , then 
Teor = (, + Ld, + zn tr’ve + 27x , 
where w, = (d * L”),-1. 
If d, = 1 ¥ 6, , then with probability p,(a,) 


T. T. T 
Leet = ZU, Fe trv, tatro, + hk. 


If d, = 1 = b, , then with probability p.(z;) 
Teo = ZU fe tr've + rox + hy. 
If d, = 1 ¥ b, , then with probability p3(x,) 


Ur+1 = Zz" Ux — @ + r’y,, —- 4 mee ays a bys 
If d, = —1 = b, , then with probability p,(2,) 
Crt = ZU ae 2 ly r'vVy “f rox et ae 


where p,(%;), Po(Xx), P3(%,), and p4(x;,) are still defined as in Section IV. 

The kth bit probability of error is now evaluated by a 2M + N 
dimensional iteration scheme where the first-order iteration function 
U'f is written as 


N M M 
U'f{(8, @, Y) = p> 6, + PD Om + ye 1) 


i> 20. 7 €1,n) ae bs (Tm Pm ad Gm) So me Pa = hw) | 


m=) 
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N M M 
ot pl > 6, + > Pm + » v0) 
n=1 m=1 m=1 


ff (2nOn + €1.n) + 2 (mm) + 2. (Ym + hn) | 


=e r{ 6, + 2 Pm + > v0) 
A> (En In — €1,n) a 2d (TmPm _ Am) + = Ove i. ho | 
+ pi in + a Om + Sy v0) 


m=1 


N M AL 
AX (2n8n = €1.n) oil p> (mm) ad > (ave 7 hn) | 
It once again follows that the probability of error on the kth digit is 


p(k) > U"lp, fe Ps] lecesvno ’ 


where U* is the kth 21 + N dimensional iterate of U’. 

To reward the reader for his patience up to this point, we will at 
least demonstrate that the general expression for p(k) given above 
reduces to Zador’s result for the single low-frequency pole, perfect 
cancellation case. The assumption of no high-frequency signal shaping 
and perfect cancellation imply that 0, e,, and y, 1, are, respectively, 
zero. Furthermore, a single low-frequency pole results in 7, , ¢,, and 
a, being the only nonzero components of r, @, and a, respectively. Under 
these conditions, 


p(k) = U'lp. + ps) leseo 
where 
Uf _ Dilef(rigr a a) Si P(e.) frie) 
+ pseif(rigr — G) + pili) f(ried 


which is identical to Zador’s result upon combining po(g1) and p4(¢1). 


VI. THE EFFECT OF NOISE CORRELATION 


In this part, the emphasis is placed upon removing the restriction of 
uncorrelated noise while at the same time arranging the results in a 
form which allows easy comparison with the uncorrelated case. The 
approach to be followed is the reformulation of Zador’s work into 
an operational iteration procedure which acts on the joint distribution 
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of the noise samples. The details are presented for the simple binary 
case with perfect feedback cancellation considered by Zador. With 
sufficient patience, extension to the more general situations covered in 
the foregoing sections can be accomplished, but that is not done here. 

To review, the operation of the simplified system may be described 
by the equation 


b, = sen {n, + god, + x}, (1) 
where 
k-1 
x, = 2 Jx- is (2) 
biends 


represents the intersymbol interference accumulated at time t, as a 
result of errors (d; + b;) prior to that time. 
The system output 6, is in error when 


+t <—g and d = 1 (3) 


Ni + ru, > go and dy, = —], 


Since the noise samples are not assumed to be independent, the random 
variables n, and x, are not independent. Hence, the distribution of the 
effective noise n, + x, is not simply the convolution of the distributions 
of n, and x,. Instead, the expression for error probability p(k) = 
prob {b, + d,} must be written as 


p(k) = p / : m(N, , t,) An, dr, 


= @ / i M2(N, , Xz) An, Ax, , (4) 


where mo(nz, 2) 18 the joint density function of n; and 2, and p and q 
are the a priort probabilities of a +1 and —1, respectively. 

A careful examination of the branching process described in Refs. 5 
and 6 for the uncorrelated case shows that a similar process governs 
in the correlated noise case. Define the integral operators p1(x), po(x), 
p3(x) by 


pila) = p a _ 


p(x) = [- =f i seas [ (5) 


P3(x) = af 


Go-z * 
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Note that the action of each of these three operators on a single di- 
mension Gaussian density function results in the three transition prob- 
abilities defined by (13) of Ref. 5. If f(x) is defined analogously (but 
operationally) as 


f(x) = pilx) + p(x) = p if _ Fg I i i (6) 


then the first-order iterative operator Uf(x) is expressed as in Zador, 
namely: 


Uj(x) = pila)fx — a) + pola) f(rx) + ps(a)frx + a) (7) 


with a = —2g,. We note that after separating f(x) into its two com- 
ponents parts, each term of (7) represents a double integration and 
thus (7) has meaning only when applied to a second-order density 
function. Proceeding as in Zador, the error probability on the k+1th 
digit in a random input sequence is expressed as the kth iterate of the 
operator Uf(x) acting on the k+1 dimensional joint density function 
of the noise process V%41 (71, yo, * * * » Yr+1) evaluated at x = 0, i.e., 


p(k) = U*f(a) feeds ye ++ + Ye+1)] [eno.* (8) 


The meaning of iteration for the operators defined here is the same 
as in Zador’s functional case. As an example, we write out p(1) in 
detail: 


—gota 


pl) = Uf(x) von, ¥2)] leno = D a ia vl »¥2) dy dye 
+ pq [- Le vor , 2) Avi dy, + p [ x vo(¥1 , Yo) dy: dye 
taf f atnswandn—o ff mtn swan dn 
— pq [- fe v1 » 2) dy: dy2 — pq [ [- v1 » 2) Av dvs 


—¢ / / VAV1 » Yo) dy1 dys + pg | / V2o(¥1 » Yo) dy: Aye 


se q i i 7 v2o(¥1 V2) dy dye . (9) 


*The convergence of the operational iteration procedure defined by (7) and 
(8) has not yet been proven. Nonetheless, we proceed with our results. 
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The above expression for p(1) can be simplified for a symmetric den- 
sity function v2. It is further emphasized that the arguments of each 
term in the operator as defined by (7) determine the limits on the 
integrals in (9) (i.e., the region of integration). 


VII. CONCLUSIONS 


The analysis presented in this paper might in a broad sense be 
described as vector and operational extensions of the work of Zador. 
In addition to simply considering a vector of low-frequency poles, 
however, the vector approach has enabled us to remove certain other 
restrictions from the basic regenerator problem such as lack of high- 
frequency signal shaping and perfect tail cancellation. Although, the 
question of convergence of the operational iteration scheme for cor- 
related noise samples remains as yet unanswered, the formulation itself, 
is of interest. Little has been suggested for solving the exact computa- 
tional problem. A future paper will discuss some useful approximations 
to cases of relatively low dimensionality. This will generalize results 
given in Ref. 8. 
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Factoring Polynomials 


Over Finite Fields 


By E. R. BERLEKAMP 


(Manuscript received May 9, 1967) 


We present here an algorithm for factoring a given polynomial over 
GF(q) into powers of irreducible polynomials. The method reduces the 
factorization of a polynomial of degree m over GF(q) to the solution of 
about m(q — 1)/q linear equations in as many unknowns over GF(q). 


There are many applications in which one wishes to factor poly- 
nomials. Some programming systems, such as Brown’s ALPAK,;,? deal 
with polynomials and rational functions with integer coefficients. In 
such a context one is interested not in approximate numerical values 
for the real and complex roots, but rather in irreducible factors which 
are themselves polynomials with integer coefficients. One of the stand- 
ard tricks mentioned by Johnson? for finding such irreducible factors 
is to reduce all of the coefficients of the original polynomial modulo 
some prime, p, and then factor the reduced polynomial over the Galois 
Field, GF(p). If the reduced polynomial factors, one gets certain 
constraints on the factors of the original polynomial; if the reduced 
polynomial does not factor over GF (p), then one may conclude that 
the original polynomial is irreducible over the integers. The success 
of this method for factoring polynomials over the integers clearly de- 
pends upon having an efficient procedure for factoring polynomials 
over GF (p). 

The problem of factoring polynomials over finite fields arises di- 
rectly in Golomb’s study* of feedback shift register sequences. In 
Golomb’s words, this study “. .. has found major applications in a wide 
variety of technological situations, including secure, reliable and ef- 
ficient communications, digital ranging and tracking systems, deter- 
ministic simulation of random processes, and computer sequencing and 
timing schemes.” The properties of all cyclic error correcting codes, 
including the important Bose-Chaudhuri!-Hocquenghem® codes, de- 
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pend on the factors of their generator polynomials in some finite field. 

Such codes have been studied extensively by Peterson® and Mac- 

Williams.” Recent advances in decoding techniques by Berlekamp® 

make these codes even more attractive from the practical standpoint. 
We present here an algorithm for factoring a given polynomial, 


f= Dita, fe Gra), 


into powers of irreducible polynomials. 
First, we construct the m X m matrix Q over GF(q), whose ith 
row represents 2*“~” reduced modulo f(z). Specifically, 


m-1 
ZU = De Qesa nase” mod f(z). 
k=0 


The Q matrix may be computed with a shift register wired to multiply 
by z mod f(z). The register is started at 1, which is the first row of Q. 
After q shifts, the register contains the second row of Q; after g more 
shifts, it contains the third row of Q, --- , etc. After g(m — 1) shifts, 
it contains the last row of Q. 

Given any polynomial g(z) of degree <m over GF(q), g(2) = >_"%23 9.2"; 
we may compute the residue of (g(z))* mod f(z) by multiplying the 
row vector [9o, 91; °°* » Jm-1] by the Q matrix. This follows from the 
observation that 


m—1 m—1 m—-1 

(g@))* = g@) = 2 gz = yD 6 ant) 
m—1 m~1 

= a (= Os Qeer.nvs et 
k=0 #=0 


Similarly, we could compute (g(z))* — g(z) mod f(z) by multiplying 
the row vector [90 , 91, *°* » Jm-1) by the matrix (Q — J), where J 
is the m X m identity matrix over GF(q). 

Second, we find a set of row vectors which span the null space of 
(Q — 1). This may be done by appropriate column operations on the 
matrix (Q — J).° Each such row vector in the null space of (Q — J) 
represents a polynomial g(z) which satisfies the equation (g(z))* — 
g(z) = 0 mod f(z), and conversely, each g(z) which satisfies this equa- 
tion is represented by a row vector in the null space of (Q — I). 

Third, we select any of the polynomials g(z) found in the second step, 
and apply Euclid’s algorithm to determine the greatest common 
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divisor of f(z) and g(z) — s for each s ¢ GF(q).* We then have the 
factorization 


1@) = ue (g.c.d. (f@), g@ — 8). 
Remark: If g(z) is a scalar, then this factorization degenerates into 


f(z) = g.c.d. (f(2), 0) IT g.c.d. (b(2), s) 
= f@ JI 1. 


s¥0 


However, if g(z) has positive degree, then the factorization is non- 
trivial. 

Proof: Since (g(z))* — g(z) = 0 mod f(z), f(z) divides (g(z))* — 
g(z) = [cerca (g(2)) — s. Therefore, f(z) also divides 


[]..cF a (g.c.d. (f(@), 9@) — 8)). 


On the other hand, g.c.d. (f(z), g(z) — s) divides f(z). If s ¥ t, and 
s, t e GF(q), then g(z) — s and g(z) — ¢ are relatively prime, as are 
g.c.d. (f(z), g(z) — s), and g.c.d. (f(z), g(z) — t). Therefore, 


L]..¢r:a (g.¢.d. (f(z), g@) — 8)) 


divides f(z). Assuming both polynomials to be monic, they must be 
equal since each divides the other. Q.E.D. 


Example I: Let f(z) be the binary polynomial 1110001110001, or f(z) = 
l+e+t2+2° +27 + 2 + x”. The successive powers of z are 





100000000000 111000111000 
010000000000 011100011100 
001000000000 001110001110 
000100000000 000111000111 
000010000000 111011011011 
000001000000 100101010101 
000000100000 101010010010 
000000010000 010101001001 
000000001000 110010011100 
000000000100 011001001110 
000000000010 001100100111 
000000000001 


* In practice, there is no need to perform all of Euclid’s Algorithm q separate 
times to determine all of the g.c.d.’s. A short cut will be seen in the example. 
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100000000000 000000000000 
001000000000 011000000000 
000010000000 001010000000 
000000100000 000100100000 
000000001000 000010001000 

SO 000000000010 and 000001000010 
Q = 111000111000 Q — I = 111000011000. 
001110001110 001110011110 
111011011011 111011010011 
101010010010 101010010110 
110010011100 110010011110 
001100100111 001100100110 


If we number the columns of Q — IJ from 0 to 11, then the upper 
right quarter of the Q — J matrix may be zeroed if we add the 3rd 
column to the 6th column, the Ist, 2nd, and 4th columns to the 8th 
column, and the 5th column to the 10th column. The lower right 
quarter of the Q — JI matrix then becomes 


011000 
111110 
_ 011001 
~ 010110° 
011110 
001110 


The equation [g5, gz, -*: , guj = O is found to have solutions 
[96597 °° » ul = [A, 0, 0, A, 0, A] where A = 0 or 1. The first six 
coordinants of g are then readily found from the equation g(Q — J) = 0, 
with solutions g = [B, A, 0, A, A, 0, A, 0, 0, A, 0, AJ]; A, B e GF(2). 
Finally, we apply Euclid’s algorithm to f(z) = 1110001110001 and 
g(z) = s10110100101. By letting ¢ = s + 1, and leaving s as an in- 
determinate, we may effectively find the g.c.d. of 111000111001 and 
010110100101 with the same computation that computes the g.c.d. 
of 111000111001 and 110110100101: 


1110001110001 
310110100101 

1 2001110101 
310110100101 
s0 #11101 

1 2001110101 

1 20s1s01 

s0 111101 

ttttOt 


Rk 
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If t = 0, the g.c.d. is 10011101; if s = 0, the g.c.d. of 1110001110001 
and 010110100101 is equal to the g.c.d. of 111101 and 11001001, which 
is 111101. Both 111101 and 10011101 are irreducible and the factoriza- 
tion is complete: 


Itze+2/+24+27424 2") 
=(l+24274242)14+ 24242 +2') over GF(2). 
In general, suppose f(z) = []; (p“?(z))*', where each p“(z) is 
an irreducible polynomial over GF(q). Then f(z) divides 


J].crco (92 =) 
if each (p‘(z))** divides g(z) — s; for some s; e GF(q). On the other 
hand, given any set of scalars s,, s., °-+ , 8, ¢« GF'(q), then the Chinese 
remainder theorem guarantees the existence of a unique g(z) mod f(z) 
such that g(z) = s; mod (p“(z))** for all 7. Since there are g” choices 


of 8; , 8S, ++: ,S,, there are exactly g” solutions of the equation (g(z))* — 
g(z) = 0 mod f(z). Therefore, 


The number of distinct irreducible factors of f(z) is equal to the dimen- 
sion of the null space of (Q — I). 


In particular, the polynomial f(z) is the power of an irreducible 
polynomial iff the null space of (Q — J) has dimension 1. In this case, 
the only solutions of (g(z))* — g(z) = 0 mod f(z) are scalars in GF(q), 
and the null space of Q@ — I contains only vectors of the form 
[s, 0, 0, «++ , O]. If the null space of Q — J has dimension n, it has a 
basis consisting of n monic polynomials: g“(z), g™(z), «++ , g™(2). 
Without loss of generality, we may assume that g™(z) = 1 and that 
the other n — 1 basis polynomials have positive degree. 

When we apply Euclid’s algorithm to f(z) and g® (z) — s, we obtain 
a factorization of f(z). If this gives fewer than n factors of f(z), then 
we may compute the g.c.d. of g®(z) — s and each known factor of 
f(z). By this process, we may continue to refine the factorization of 
f(z). The following argument shows that this process must eventually 
yield all n irreducible-powers which are factors of f(z). 

Let C be the » X mn matrix over GF(q) defined by the equations 
g°?(2) = Ci,; mod (p“(z))**. Then C must be nonsingular. For if 
>; AC; = O for all z, then >); Ajg? (2) = 0 mod (p‘(z))** for 
all 7, whence >); A,g‘?(z) = 0, contradicting the linear independence 
of g™(z), g™(z), «++ , gz). When we apply Euclid’s algorithm to 
f(z) and g‘?(z) — s, we obtain a factorization of f(z) into as many 
different factors as there are distinct elements in the jth row of C. 
The irreducible-powers (p“”(z))** and (p“ (z))** are separated iff C;,; #4 
C,,; . Since C is nonsingular, for every 7 and k there exists some j such 
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that C;,; + C,,;. Thus, any two irreducible-power factors of f(z) 
will be separated by some g‘”’(z). 

The factorization of any power of an irreducible polynomial is 
readily accomplished by applying Euclid’s algorithm to the poly- 
nomial and its derivative. 

We conclude with another example. 


Example II: Following a suggestion of R. L. Graham, we let f(z) = 
2” — lover GF(q), where n and q are relatively prime. Then Q;4; 34: = 1 
if gi =j mod n. Specifically, suppose n = 15 and gq = 2. Then 


100000000000000 ooooococded0000 =O. 
001000000000000 011000000000000 1 
000010000000000 001010000000000 2 
000000100000000 000100100000000 3 
000000001000000 000010001000000 4 
000000000010000 000001000010000 = 5 
000000000000100 000000100000100 =6 
000000000000001 @ — I = 000000010000001 7- 
010000000000000 010000001000000 8 
000100000000000 000100000100000 9 
000001000000000 000001000010000 10 
000000010000000 000000010001000 11 
000000000100000 000000000100100 12 
000000000001000 000000000001010 13 
. 000000000000010 ooceco000Ceddl1 14 


By suitably permuting the rows and columns, we can bring Q — I 
into the form 

ojoo09lo000\o000|00__ 0 
0/1100)0000/0000/00 =i 
0/0110|0000/0000/00 + 2 
0,0011;0000)0000|;00 4 
0[1001|000010000}00__ 8 
0\0000}1100|0000|00 7 
0|0000\0110\0000|00 14 
0/0000/0011/0000\00 13 
olo000|1001\o000l00 11 
O/O000;0000/1100}00 3 
0/0000,0000/0110,/00 +6 
0/0000;0000/0011/00 12 
ojoo00|o000|1001]00__9 


0j}0000)0000/0000/11 5 
0j0000;0000;0000}11 10 
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A basis for the null space of Q — I is seen to be 
gG@°@azt74e2442 
g?@ =e +e + 2% 42% 
GQ? @aF+tHi 42°42 
g?°@ =e +2”. 
In general, if f(z) = 2” — 1 over GF(q), then we may choose 
9@ = Le, 


where C is any set of numbers which is closed under multiplication 
by q mod n. Each such polynomial g(z) has some nontrivial factor in 
common with 2” — 1. 
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The Enumeration of Information 


Symbols in BCH Codes 


By E. R. BERLEKAMP 
(Manuscript received May 9, 1967) 


This paper presents certain formulas for I(q, n, d), the number of 
information symbols in the q-ary Bose-Chaudhuri-Hocquenghem code of 
block length n = gq” — 1 and designed distance d. By appropriate ma- 
nipulations on the m-digit q-ary representation of d, we derive a simple 
linear recurrence for a sequence whose mth term is the number of informa- 
tion symbols in the BCH code. 

In addition to an exact solution of all finite cases, we obtain exact as- 
ymptotic results, as n and d go to infinity while their ratio n/d remains 
fixed. In this limit, the number of information symbols increases as n°. 
Specifically, we show that for fixed u,O Su S 1, 

lim g ™*I(g, q” — 1, ug") = J, 
where s is a singular function of u. The function s(u) is continuous and 
monotonic nonincreasing; it has derivative zero almost everywhere. Yet 
s(0) = 1 and s(1) = 0. For q = 2, s(u) ts plotted in Fr. 1. 


Any cyclic code of block length n over GF(q) may be defined by its 
generator polynomial, g(x), which is some factor of x” — 1 over GF(q), 
or by its check polynomial, h(x) = (2 — 1)/g(x). The number of 
check digits in the code is given by the degree of g(x); the number of 
information digits, by the degree of h(x). We assume that n and q 
are relatively prime. It is most convenient to work in a particular ex- 
tension field of GF(q), namely GF(q"), where m is the multiplicative 
order of q mod n. In this field, 2* — 1 factors into distinct linear factors: 


a’—1= [J @—- a’). 
i=1 


Here a@ is any primitive nth root of unity in GF(q”); a” = 1. From the 
factorization x” — 1 = g(x)h(x), we see that every power of a is a root 
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u (IN BINARY) 


Fig. 1— Graph of s(u) vs. u. 


of either g(x) or of h(x), but not both. Thus, a cyclic code partitions 
the powers of a into two sets: those powers which are roots of the 
generator polynomial, and those powers which are roots of the check 
polynomial. If g(x) and h(x) were permitted to have coefficients in 
GF(q"), then any partition of the powers of a would define a cyclic 
code. However, the coefficients of g(x) and h(x) must lie in the ground 
field GF(q). Consequently, if a* is a root of g(x), then so are the con- 
jugates of a, namely a‘, a’, a”, --+ . Conversely, if all conjugates 
of roots of g(x) are also roots of g(x), and all conjugates of roots of 
h(x) are also roots of h(x), then all of the coefficients of the polynomials 
g(x) and h(x) lie in GF(q). 

The previous remarks hold for all cyclic codes. 

A q-ary BCH code of block length n over GF(q) may be defined as 
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the cyclic code whose generator’s roots include only a, a’, «+: , a” 
and their conjugates. This code is capable of correcting any combina- 
tion of less than d/2 errors; (cf. Berlekamp’) the minimum Hamming 
distance of this code is at least d. For this reason, d is called the designed 
distance of the code. 
The first result on the number of information symbols in BCH codes 
is the following lemma: 


Classical Lemma I: Let I(q, n, d) be the number of information symbols 
in the q-ary BCH code of block length n and designed distance d. 


Define [7] by the equations 
t= [t]modn and 1Sf[r] Sn. 


Then I(q, n, d) is the number of integers j, such that1 < j S$ nand[jq"]| = d 
for all k. 


Proof: a’ is a root of the generator polynomial of the BCH code iff 
there exists some k(j) such that [jq"] < d. Conversely, a’ is a root 
of the check polynomial iff [jq"] = d for all k. Q.E.D. 


The classical lemma enables one to compute the number of informa- 
tion symbols in any q-ary given BCH code without doing any cal- 
culations in GF(q) or its extensions. One need only enumerate certain 
types of residue classes mod n. In practice, this enumeration is still 
often tedious, particularly when n and d are large. 

In order to obtain more tractable results for large n and d, we prefer 
to start from an alternate form of the classical lemma: 


Classical Lemma II: Let I(q, n, d) be the number of information symbols 
in the qg-ary BCH code of block length n and designed distance d. 


Define |_2| by the equations 


t= [t|modn and O0S|[t]Sn-1. 


IA 


Then, I(q, n, d) 1s the number of integers 2, such that 0 tin-1 


and | iq’ |<n-+1-—d forall k. 
Proof: 15 j S nand[jq*] = d forall kifO0 S m—j) S$ n—-1 


and | (n — j)q*| S$ n — d forall k. Let i = n — j. Q.E.D. 


In the wide sense, BCH codes may be defined over any alphabet 
whose order, g, is a prime power, and for any block length, n, which 
is relatively prime to q. In the narrow sense, however, 7 is required to 
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be one less than a power of g. For narrow sense codes, the smallest 
extension field of G/'(q) which contains the nth roots of unity is GF(n+1). 
For wide sense codes, this extension field is always larger, usually 
much larger. Since the decoder must perform certain computations 
in this extension field, narrow sense BCH codes are more easily im- 
plemented than their more general wide sense counterparts. 

We shall enumerate the information symbols in narrow sense BCH 
codes by reducing the problem to the enumeration of certain kinds of 
sequences over the alphabet consisting of the integers 0, 1, 2, --- ,q— 1, 
as first suggested by Mann.” We begin by defining the appropriate 
manipulations with such sequences. 

We shall always use capital letters for sequences. We let (Q — 1) 
denote the sequence consisting of the single letter gq — 1. Unless other- 
wise stated, we allow every sequence to be either finite or infinite. 

Let V = V,V2V3 --- be any finite or infinite qg-ary sequence (..e., 
a sequence of numbers V;, where V; is an integer, 0 S$ V; S$ q — 1. 
We let V = V,V2V;3 --- denote the complement of V, defined by V; = 
(q- 1) — V; for allz. If W = WiW,. --- W, is a finite q-ary sequence, 
then we may form the cyclic shifts of W: WoaWz --- W,W1, WW, 
--» W,W,W., --- . If X is a finite qg-ary sequence, X = X,X_2 :-- 
X;, then we may form the concatenation X * V = X,X.X3 °°: 
X;ViV2V3 +++ . This concatenation may be formed whenever V is 
a finite or infinite g-ary sequence. If V is a finite qg-ary sequence, then 
V * X is a cyclic shift of X * V. 

The qg-ary sequence Y is said to be a prefix of X iff X = Y * Z for 
some Z; Y is said to be a suffix of X iff X = Z* Y for some Z. A prefix 
must be a finite (or empty) sequence; a suffix may be empty, finite, 
or infinite. V is a proper prefix of X iff X = V * Z, and neither V nor 
Z is empty. Z is a proper suffix of X iff X = V * Z and neither V nor 
Z is empty. If X is a finite g-ary sequence, XY = X,X,--- X,, then we 
may form the iterated concatenation of X with itself, X = X,X, -:- 
X,X,X_ +++ X, +++ . In particular, (Q — 1) denotes the infinite q-ary 
sequence all of whose letters are q — 1. 

Wesay X < Y iff there exists aj such that X; = Y; for? = 1,2,---, 
j — 1, but X; < Y;. If X « Y and Y < X, then one is a prefix of 
the other. 

This ordering is similar to the numerical ordering of g-ary fractions, 
but there are important differences. For example, | = 0.01 < 0.0101 = 
5/16, but the sequences 01 and 0101 are incomparable, because one is 
a prefix of the other. On the other hand, 0.0111111 --- = 0.1 = 3, 
yet 01111 --- < 1. This type of example may be excluded by writing 
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all fractions in their terminating form if they have one. We may then 
assert the following: 


Let 
UuU= Ss Ug *, v= > Va U = U,G_U; <8 


and V = V,V.V3--- , and suppose that (Q — 1) ts not a suffix of U or V. 
Then 
U<V>u<p 


\y <V 
UuUsvues or 
le is a prefix of V. 


We say that X is an immediate subordinate of Y iff X is a finite se- 
quence, X = X,X.X3--- X,, and X, = Y,, X22 = Y2,°°: , Xe = 
Y,-1, but X, < Y,. The sequence Y has Y, immediate subordinates 
of length 1, Y. immediate subordinates of length 2, Y; immediate 
subordinates of length 3, --- Y, immediate subordinates of length k. 
If the sequence Y has only a finite number of nonzeros, then we may 
define the greatest immediate subordinate of Y. If the last nonzero in 
the sequence Y = Y,Y,. --- is Y,, then the greatest immediate sub- 
ordinate of Y is Y,Y, --- Y,_,(Y, — 1). If the sequence Y contains 
an infinite number of nonzeros, then Y has infinitely many immediate 
subordinates. All of them are less than Y itself, but none of them is 
the greatest immediate subordinate. 

Similarly, we say that Y is an immediate superior of X iff Y = Y,Y.Y3 
--> Y,, where Y; = X,, Yo = X2,°°* , Yu-r = Xz-1 but Y, > X,. 
If X = X,X, --- X, and X, # (Q — 1), then the least immediate 
supertor of Xis Y = Y,Y.--- Y,; Y; = X; for? = 1, 2,---,k — 1, 
and Y, = X, + 1. It should be evident that the least immediate superior 
is among the longest immediate superiors, and the greatest immediate 
subordinate is among the longest immediate subordinates. 


Definition: Tf q is any integer, U is any infinite g-ary sequence and 
m is any integer, we define J(q, U, m) to be the number of g-ary m- 
tuples all of whose cyclic shifts are less than U. 


Lemma III: (Complemented form of Mann’s Lemma) 
Tj 
n=g"—-1, nt+t1l—d= Uiq"™, OSU: <4, 
+=1 
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U = U,U,-:- U,, and Y ts any q-ary sequence then 
I(q,n, a) = J(q, U * Y, m). 


Proof: Lemma III reduces to Lemma II under the following cor- 
respondence: The g-ary m-tuple U corresponds to the integer n + 
1 — d; another q-ary m-tuple W = W,W, --- W,, corresponds to the 
integer w = >)”, Wig” ’. The first cyclic shift of W is the sequence 
W.W3 --- W,,W,, which then corresponds to the integer 


m—1 


2d Ver +W,= qu — (qe = WwW, ‘ 


Modulo n = gq” — 1, the integer corresponding to the first cyclic shift 
of W is seen to be congruent to qw. Therefore, the successive cyclic 
shifts of an m-digit g-ary sequence W correspond to the integers [_w |, 


| wq |, wa’ |, +++, Lwq”"* |. These integers are all <n + 1 — d iff all 
cyclic shifts of W are <U, which is true iff all cyclic shifts of W are 
<U * Y, for any Y. Q.E.D. 


The choice Y = U has an interesting interpretation: 
po Ug’ = pm y oF alate = (> ue la _ a7) 


i=1 i=1 
(d=), 
n 


ll 


> ua aiyer= 


Thus, the sequence U is the g-ary expansion of 1 — (d — 1)/n. For 
this reason, we may investigate the behavior of I(q, n, d) for large n 
and d with a fixed fractional error correction capability, (d — 1)/2n, 
by studying J(g, U, m) as a function of m for fixed g and U. 

We shall temporarily ignore the periodicity of the U sequence, 
and consider the function J(g, V, m) for an arbitrary q-ary sequence 
V. We assume only that the sequence V has no terminal zeros. 

From the definition of the immediate subordinates of V, it is clear 
that 7f an m-digit qg-ary sequence W ts less than V, then some immediate 
subordinate of V is a prefix of W. For if W is less than V, then there 
exists a k such that W; = V; forz = 1, 2,---,k—1, butW, < V,, 
and the sequence W,W, --- W, is a prefix of W and an immediate 
subordinate of V. 

Now suppose that some m-digit sequence W and all of its cyclic 
shifts are less than V. Since W itself is less than V, some prefix of W 
must be an immediate subordinate of V. Are all possible immediate 
subordinates of V possible prefixes of W? In general, they are not, for 
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some immediate subordinates may have suffixes which are greater 
than V. If X * Y is an immediate subordinate of V and Y is greater 
than V, then X * Y cannot be a prefix of W. For, if W = X * Y * Z, 
then one of the cyclic shifts of W is Y * Z * X which is greater than V. 

For example, consider the ternary sequence V = 20212. Its im- 
mediate subordinates are 0, 1, 200, 201, 2020, 20210, and 20211. The 
immediate subordinate 20210 has the suffix 210 which is greater than 
V. Therefore, if 20210 is the prefix of W, then the second cyclic left 
shift of W is greater than V. Similarly, V’s immediate subordinate 
20211 has the suffix 211, which is also greater than V. 

For some sequences V, this difficulty does not arise. If V exceeds 
all of its own proper suffixes, then we have the following theorem: 


Theorem 1: Let V be a q-ary sequence which exceeds all of its own 
proper suffixes. Then: 

(1) No immediate subordinate of V ts a proper prefix of any other 
immediate subordinate of V. 

(it) Every suffix of every immediate subordinate of V is a concatena- 
tion of other immediate subordinates of V. 

(iit) If W and all of its cyclic shifts are less than V, then W can be 
uniquely decomposed into a concatenation of immediate subordinates of V, 
including a (possibly empty) end-around immediate subordinate. Spe- 
cifically Ww = w * Ww Keee & Wwe * wer * wee Kees & wi) * 
W?;,W™, WO ,--»,W are immediate subordinates of V; W‘” * 
WY? * WO « ... * W™ ds the end-around immediate subordinate. The 
end-around immediate subordinate has a prefix, W‘”, which is a suffix 
of W, and a suffix W * W * ... * W™ which is a prefix of W, as 
well as a concatenation of the shorter immediate superiors W™, W™ , 
oe WO 

(iv) Every concatenation of immediate subordinates of V, including 
a (possibly empty) end-around immediate subordinate yields a sequence 
which has the property that all of ats cyclic shifts are less than V. No 
such sequence of length m can exceed the maximum m-digit concatenation 
of immediate subordinates of V. If Y is the maximum m-digit concatena- 
tion of immediate subordinates of V, and Y S U S V, then J(q, V,m) = 
J(q, U, m). 


(v) 
m-1 
J(q, V,m) = mV, + Ds Vid(q, V,m — k), 
k=l 


where V,; is taken as 0 tf 7 exceeds the length of the sequence V. 
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(vt) Let 
n=q"—1, d=ZD.q"", OSD;<4, 
D = D,D, +++ Dn. 
If 
Ve Qo) <p = 
least m-digit concatenation of immediate superiors of V, 


then, 
I(q,n, dad) = J(q, V, m). 


Proofs: 


(4) This property of immediate subordinates does not even depend 
on the suffix condition on V. From the definition of immediate sub- 
ordinates, each immediate subordinate must disagree with V only 
in the immediate subordinate’s last digit, and hence no immediate 
subordinate can be a prefix of any other. 

(it) Let us first prove the weaker assertion: 

(a) Every proper suffix of every immediate subordinate of V has 
a prefix which is a shorter immediate subordinate of V. 

Let S be an immediate subordinate of V, and let S® be a suffix 
of S. We may write S = S® * §™. Since S differs from V only in its 
last digit, S“ is a prefix of V, and V = S” * V™. Since S < JV, 
S® < V™. Since V exceeds all of its own proper suffixes, V° < V. 
Therefore, S°’ < V. Therefore, some prefix of S® is an immediate 
subordinate of V. 

(b) If every suffix of an immediate subordinate has a prefix which 
is an immediate subordinate, then every suffix of every immediate 
subordinate is a concatenation of immediate subordinates. 

For, suppose F is a suffix on an immediate subordinate, then F = 
B” « F® where B™ is an immediate subordinate. Since /' is a suffix 
of F, it is also a suffix of an immediate subordinate, and F® = B?* Fr, 
where B™ is an immediate subordinate --- F = B™ * BM * BS *... 

(iii) Since W < V, it contains a prefix W™ which is an immediate 
subordinate of V. After shifting this prefix around to the end, we may 
similarly identify VW, W®, --- , W‘-”, each of which is an im- 
mediate subordinate of V. The sequence W'S” * W * WS # ... * 
W-» is a cyclic shift of W, and so it must have a prefix, P, which 
is an immediate subordinate of V. P is not a prefix of W‘”, so W‘” 
must be a prefix of P. Suppose that W“ * W™ * --- * W™ is a prefix 
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of P, but that W? * W™ * .-- * W°*” is not a prefix of P. (This 
defines 7.) Then P = WS? * W™ * ... * W™ * S, where the (possibly 
empty) sequence S is a proper prefix of W“*”. Since S is a suffix of 
P, which is an immediate subordinate of V, S is itself a concatenation 
of immediate subordinates of V. But no immediate subordinate of V 
is a proper prefix of any other immediate subordinate of V, so S must 
be empty. 

(wv) This is the converse of (277). Suppose we are given the sequence 
W= so * w™ * Ww eee wih * pe. where Ww, Ww, ees 
W'-? andW® = P™ * §® are immediate subordinates of V. We must 
show that all cyclic shifts of W are less than V. Any cyclic shift is of the 
form C os Ss * wer * Wt?) xk eee & Ww? * we * Ww? Be ok yak 
wt? « P™ where WW = P™ * S®, If S™ is empty, C has the 
prefix W“*”, which is an immediate subordinate of V. If S“ is not 
empty, by (zz) it has a prefix which is an immediate subordinate of V, 
which is a prefix of C. In either case, C has a prefix which is an im- 
mediate subordinate of V. Therefore, C < V. 

(v) V has V,, immediate subordinates of length m, each of which 
has m distinct cyclic shifts. Thus, W may be chosen as a single end- 
around immediate subordinate of V in mV, ways. 

If W is a concatenation of several immediate subordinates of V, 
W= Ww * Ww * eee & yo * ws? where w™, oe vee, ye 
are immediate subordinates of V and W‘” is a (possibly empty) proper 
prefix of the immediate subordinate W‘” * W™ * .-. * W™, then the 
length of W is the length of W‘’” plus the length of VW“ * W™ * ++ * 
W-? « W, For each k, there are V, choices of W‘'~” of length k, 
and J(q, V, m — k) choices for W * W * .-. *# WO? « W™, 

(vt) Least special case: Suppose D is the least m-digit g-ary sequence 
greater than V * (Q — 1). Letting 


@= sD", eS SVG i= DV iq", 
it is evident that d + 0 =n-+1landv = n+ 1 — d. According to 
Lemma III, [(q, n, d) = J(q, V, m). 

(vt) Greatest special case: Let D be the least m-digit concatenation 
of immediate superiors of V. Complementing, D is the greatest m-digit 
concatenation of immediate subordinates of V. In the notation of 
part (2), D = Y. Letting d = =D,q""‘,d = n — d. Lettingn + 1 — 
d = LU,q"*, U > Y because n + 1 — d > n — d. Theorem follows 
from part (7v) and Lemma III. 

(vt) The general case follows because J(qg, U, m) is a monotonic 
function of U. Q.E.D. 


1870 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1967 


Example I: Let V be the binary sequence 1101. We compute 


Bose distancet Designed distance 

m J(q, V,m) Binary Decimal Binary Decimal 
1 1 1 1 1 1 
2 3 01 1 01 1 
3 4 011 3 010 2 
4 11 0011 at 0011f 3 
5 16 00111 7 00110 6 
6 30 001101 13 001100 12 
7 50 0011011 27 0011000 24 
8 91 00110011 51 00110000 48 
9 157 001100111 103 001100000 96 
10 278 0011001101 205 0011000000 192 
11 485 00110011011 411 00110000000 384 


12 854 001100110011 819 001100000000 768 


Here J(q, V, m) is computed by Theorem lv. The designed distances 
are computed according to Theorem lvi, using V = 0010, with im- 
mediate superiors 1, 01, and 0011. V * (Q + 1) = OOIOLIII1I111 --- . 

Evidently, the binary BCH code of block length 2°” — 1 and designed 
distance 768 is identical to the binary BCH code of block length 2” — 1 
and designed distance 769 or 770 or --- or 819. This code has 854 
information symbols. This code is distinct from the binary BCH code 
of block length 27 — 1 and designed distance 820. This is true in 
general, because the least m-digit concatenation of immediate superiors 
of V is necessarily minimum among all of its own cyclic shifts. This 
“greatest designed distance” is called the Bose distance. 

It happens that the binary BCH code of block length 2° — 1 and 
designed distance 768 is also distinct from the binary BCH code of 
block length 2'* — 1 and designed distance 767, because the 12-digit 
binary expansion of 767 is minimum among all of its cyclic shifts. 
This, however, need not be true in general. For example, the binary 
BCH code of block length 2* — 1 and designed distance 3 is not distinct 
from the binary BCH code of block length 2* — 1 and designed distance 
2, because the 4-digit binary expansion of 2 = 0010 is not minimum 
among its cyclic shifts; the minimum is 0001. 

+ Defined later in the text. 


£ This code is identical to the binary BCH code of block length 15 and de- 
signed distance 2. 
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In general, we would like to determine the number of information 
digits in the g-ary BCH code of block length n = g” — 1 and designed 
distance d = ZD,q”" *. The previous theorem gives us a solution to 
this problem if we can find a sequence V which is greater than all of 
its own suffixes and has the property that 

VeQ-1)<Ds 

least m-digit concatenation of immediate superiors of V. 
Complementing this condition gives 
V>D2 


greatest m-digit concatenation of immediate subordinates of V. 
or 


V>D*@Q-+1)> 
( greatest m-digit en 
of immediate subordinates of V/ * 0 > X, 


where X is the greatest immediate subordinate of V. We may assume 
that V has no terminal zeros, and that the length of V does not exceed 
the length of D. Since X and V have the same length, X is a prefix of D. 

Since V is the least immediate superior of X, the problem of finding 
V is reduced to the problem of finding X, which is a prefix of D. The 
solution is as follows: 


Theorem 2: Let X be the shortest prefix of D such that 
D=X*PF, Fe@-)zdD+*@Q-), 
and let V be the least immediate superior of X. Then 
() V*Q-1)<Ds 
least m-digit concatenation of immediate superiors of V. 
(21) V exceeds all of its own proper suffixes. 
Proof of (2): 


Since X is a prefix of D and V is an immediate superior of X, V 
is an immediate superior of D.So V > D and V > D* (Q — 1). 
Complementing gives V * (Q — 1) < D¥0,s0o V* (Q + 1) < D. 
<—- to-— 
Let X“ = X* X*... * X. Then F * (Q + 1) = D* (Q =~ 1) is 
equivalent to X° * F * (Q — 1) => X¥% * F * (Q — 1). Therefore, 
X*X%*«e Fe (Q21) 2 X*# XV «KF *(Q— lorX%*+F*(Q-1)2 
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X® * F * (Q — 1). By induction, X° *F*(Q +1) =X" * F* 
QQ -— lj andD*(Q-1)2 X“ * Fh #* (Q — 1) for all k. Since this is 
true for arbitrarily large k, D * (Q — 1) 2 X. Complementing, D* 0 S 


X S any infinite concatenation of immediate superiors of V. There- 
fore, D S any m-digit concatenation of immediate superiors of V. 
Proof of (iz): 
Let X = Y * Z * L, where Y and Z are arbitrary (possibly empty) 
and L is the final digit of X. We have 
V=YrZ*e(L+1) 

D=Y*Z*L*F 
F*Q-)2YV*t*Z*LeF*(Q-—1)=X*F*(Q— 1); 
X*FtQ+1)=YV*#Z* Le F*(Q—1)>2Z2*L* F* (Q — 1), 
else Y would be a shorter prefix than X which satisfied the same condi- 

tions. 

No proper suffix of V can equal V, for the suffix must be shorter. 

If some proper prefix of V, say Z * (LZ + 1), (Z possibly empty) 
exceeds V, then 

Z*(Lt)>V#tZ4*+#L+)>VtZ*LaX. 

IfZ*L> X, then Z* L* F* (Q—1) > X* F* (Q — 1), a con- 

tradiction. If Z * L is a prefix of X, then X = Z * L * G and from 
X*FeQ-])>Z*LeF*(Q— 1) 
we have 
Z*L*@G*eF*¥Q-1)>Z*L*F *Q—)) 
G*F*e#Q-)D>F#Q-)Y)AX*F*(Q— 1). 

Now Z * L is a shorter prefix than X, a contradiction. Therefore, 
Z*(LZ+1) < Y*Z* (L+1),i¢, V exceeds all of its own proper 
suffixes. Q.E.D. 
Example II: Let g = 9,n = 728 = 9° — 1,d = 217. Then D = 261, 
D = 627, D* 8 = 627888 --- , X = 62, V = 63, V = 25. 


Bose distance 


m mi q-ary decimal 
1 6 3 3 
2 42 26 24 
3 270 263 219 





{In this and subsequent tables we use the single later J as an abbreviation 
for J(q, V, m). 
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Example III: Let q = 2, = 511, d = 185. Then D = 010111001, 
D = 101000110, D * 1 = 101000110111111 --- , X = 101000, V = 
101001, V = 010110. Immediate superiors are 010111, 011, 1. 


m J Bose distance Smaller designed distance 
1 1 1 1 

2 1 11 11 

3 4 O11 011 

4 6 Olll 0111 

5 6 O1111 01111 

6 16 010111 010111 

7 22 0101111 0101110 

8 29 01011111 01011100 

9 49 010111011 010111000 


Example IV: Let gq = 2,n = 511 = 2 — 1,d = 187. Then D = 
010111011, D = 101000100 xX, V = 101000101, V = 010111010. 


m J Bose distance 
1 1 1 

2 1 ii 

38 4 O11 

4 5 O111 

5 6 O1I11 

6 10 011011 

7 22 0101111 

8 29 01011111 

9 49 010111011 


The answer agrees with Example III, although the recurrence is 
different. This illustrates the general nonuniqueness of V. Theorem 
2 specifies one satisfactory method of finding V, but as seen from 
this example, this V need not be unique. The simplest recurrence 
rule generally arises from the shortest possible V, which corresponds 
to the greatest V, or the least D. This can generally be found by first 
reducing D insofar as permissible. 


Example V: Let n = 2° — 1, d = 411. Then D = 00110011011. 
We could take D = 11001100100 = X and proceed. However, we 
instead consider d = 410, D = 00110011010. Since D has a cyclic 
shift smaller than itself, the code is unchanged. But D = 11001100101, 
X = 1100110010 does not look much easier, so we continue. Each 
prime marks the starting point of a smaller cyclic shift. 
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D 


00110011010 
00110011’001 
00110011’000 
00110010111 
00110010110 


0011’0010000 
0011’0001111 


0011’0000000 
00101111111 


Since 00101111111 has no cyclic shift less than itself, this designed 
distance is the Bose distance of a different BCH code. We must instead 
use D = 00110000000, D = 11001111111, X = 1100, V = 1101. The 
recurrence is given in Example I: J(2, 2** — 1,411) = 485. This 
same V is obtained if we started with D = 00110011000, or any D 
in the region 


00110000000 < D ¥S 00110011000 


Example VI: Let g = 2,n = 2° — 1, D = 001010010100111, D* i = 
1101011010110001111---, X=110101101011000, V =110101101011001, 


V = 001010010100110 


m Bose distance 
1 1 1 
2 3 01 
3 4. O11 
4. 11 0011 
5 16 00111 
6 36 001011 
d 64 0010101 
8 115 00101011 
9 211 001010011 


10 378 0010100111 
11 694 00101001011 

* 42 1256 001010010101 
13 2276 0010100101011 
14 4112  00101001010111 
15 7474  001010010100111 
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Although the brute force method just used gives the right answer, a 
more devious approach proves easier. Instead of D = 001010010100111, 
let us consider D = 001010010100101, X = 11010, V = 11011, V = 
00100. This yields a different set of codes, with a much simpler re- 
currence: 


m J Bose distance 
1 1 ] 
2 3 01 
3 4 011 
4 11 0011 
5 21 00101 
6 36 001011 
7 64 0010101 
8 115 00101011 
9 211 001010011 


10 383 0010100101 

11 694 00101001011 

12 1256 001010010101 

13 2276 0010100101011 

14 4126 00101001010011 

15 7479 001010010100101 
The code with D = 001010010100111 has 5 less information digits 
than the code with D = 001010010100101, corresponding to the 5 
distinct cyclic shifts of 001010010100101. 


I. ASYMPTOTIC RESULTS 


Let us define the enumerator 
J(q, U;2 = Do J(q, U, me”. 
m=1 


Given a sequence V which is less than all of its own proper suffixes, 
we may also define 


V® = Dive’, 
k 
so that 
zV'(e) = dO kV iz". 
k 
The recurrence 


m—1 
J(q, ¥, m) = MV m = > VJ (q, Vv; m — k) 
k=1 
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becomes 
J(q, V;z) = 2V’@) + V@J(q, V;2) 


whose solution is 


7 eee A 
Let pi, p2, °*: be the (not necessarily distinct) complex reciprocal 


roots of 1 — V(z). Then 
1-V@ = IIa - o2) 
-V@= 2s Pp; IT] (1 — p72) 


iro. z > pit a pa ps (p2)" = > > pz”. 


7 1 — pz 





Therefore, 


J(q, V32) = p> » ee” 


SO 
J(q, V, m) = x pi 
where p; are the complex numbers defined by the equation 
1— V@ = ITA — ee). 


Although this gives an explicit expression for J(g, V, m), the expression 
depends upon the complex numbers p;. For finite values of m, it is 
usually easier to compute J(q, V, m) directly from the recurrence 
relation of the previous section, since these calculations involve only 
integers. For asymptotic results, however, the above equation is very 
useful. 


Definition: Let p = max |p; |, Jet s = log, p. 


Since all coefficients of the polynomial V(z) must be nonnegative 
integers not exceeding q — 1, it is easily seen that the p,; with the 
maximum absolute value is real and positive, and 1 S p S gq. Clearly, 


J(q, V, m) & p”™ 
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for large m, in the sense that 


lim p "J(q, V, m) = 1. 


MO 


Similarly, 
log, J(q, V, m) } m log, p = ms. 
If 
u= > Uiq", 
t=1 
and 


X<UsV*0O, 


where V exceeds all of its own suffixes and X is the maximum sub- 
ordinate of V, then 


ese = 1,0) 


In other words, if we fix the fraction d/n = u and let n and d grow 
large, then 


8 


Ilyn 
or, more precisely, 


s(u) = lim eM gS fae) 


For given q, the function s(u) is a rather complicated animal. To 
compute it, one must first write u in q-ary. If U exceeds all of its proper 
suffixes, set V = U; otherwise write U = X * F where X is the shortest 
prefix such that U < F. V is then taken to be the least immediate 
superior of X. Then, s is defined as the logarithm (base g) of the maxi- 
mum reciprocal root of 1 — V(z). 

It may easily be shown that s is a continuous, monotonic nonincreas- 
ing function of wu. It may also be shown that the derivative of s with 
respect to u is either 0 or it is undefined. There are two kinds of points 
at which the derivative s’(u) is undefined. First, there are the end- 
points of the intervals on which s(w) is constant. u is a lower endpoint 
of such an interval iff U is a finite sequence which exceeds all of its 
own proper suffixes; u is an upper endpoint of such an interval iff U 
is a periodic sequence, equal to some of its suffixes but not less than 
any others. At these endpoints, s(w) is undifferentiable because it has 
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only a right derivative or a left derivative, but not both. There is 
only a countable number of points of this type. 

The more interesting points are those at which s(w) has neither a 
right derivative nor a left derivative. This happens iff U is an infinite 
sequence which exceeds all of its own proper suffixes, and 0 is not a 
suffix of U. 

The set of points u such that s(u) is not differentiable is uncountable, 
but it has measure 0. Professor T. Pitcher of the University of Southern 
California has also shown’ that this set has Hausdorf dimension 1. 
This appears to be entirely due to the large density of these points 
in the vicinity of u = 0. In general, I conjecture that the Hausdorf 
dimension of the set of points in the interval a S wu S b [where 0 S a, 
b S 1, s(a) ¥ s(6)] is s(a). In some sense, almost all of the nondif- 
ferentiable points in any interval seem to lie very near the leftmost 
cluster point of the interval. 

When Mann’ first obtained results identical to those here in the 
special cases uw = gq“, he also showed that p is the only reciprocal 
root of 1 — V(z) with magnitude greater than 1. Thus, not only is 


Ip", 
but in fact, for sufficiently large m, 
I = (p"), 


where (-) denotes the nearest integer to ‘“-’. Unfortunately, this 
strengthened result is not true in general. For some values of u, 1 — V(z) 
has only one reciprocal root with magnitude greater than 1, but for 
other values of u, 1 — V(z) has many reciprocal roots with magnitude 
greater than 1. Little is known about the behavior of the smaller 
complex reciprocal roots of 1 — V(z) as a function of u, although 
B. F. Logan* has obtained a few preliminary results in this area. 


II. ACTUAL DISTANCE 


As one increases the designed distance, the number of information 
symbols in the resulting code must either remain constant or decrease. 
Thus, 


I(q, n, d) is the maximum number of information symbols in any of the 
g-ary BCH codes with designed distance = d. 


We must be careful to distinguish between I and f, defined by 
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I(q, n, d) is the maximum number of information symbols in any of the 
q-ary BCH codes with actual distance = d. 


It is obvious that L(g, n, d) = I(q, n, a). 

For example, there are three binary BCH codes of block length 23, 
having 23, 12, and 1 information symbols. The code with 23 informa- 
tion digits has Bose distance = actual distance = 1, but the code 
with 12 information digits has Bose distance 5, actual distance 7. 
The code with 1 information digit has Bose distance = actual distance = 
23. Therefore, I(2, 23, 6) = I(2, 23,7) = 1, but £(2, 23, 6) = 1(2, 23,7) = 
12. For all values of d ¥ 6 or 7, I(2, 23, d) = 1(2, 23, d). 

The known cases in which L(g, n, d) > I(q, n, d) are relatively sparse. 
Peterson, Kasami, and Lin’ and Berlekamp® have investigated this 
question for narrow sense binary BCH codes (where gq = 2 and n = 
2” — 1). They proved that f(2, 2" — 1, d) = I(2, 2” — 1, d) ifd divides 
2” — 1, or if d is one less than a power of 2, or if m’ divides m and 
f(2, 2" — 1, d) = 1(2, 2” — 1, d) > 12,2” —1,d4+ 1), orif mis 
sufficiently small, or if d is sufficiently small, or if d and/or m satisfy 
any of various other number theoretical constraints. More recently, 
Peterson and Lin’ have shown that if [(2, 2” — 1,d) = I(2, 2” —1,d) > 
I(2,2"—-1,d+1),and1 $j <m-— dthen/f(2, 2” — 1, 2’d + 27 — 1) 
= [(2, 2” — 1, 2'd + 2’ — 1). No examples are known in which [(2, 
2” — 1, d) > I(2, 2” — 1, d), and it has been conjectured that f(2, 
2” — 1,d) = I(2, 2” — 1, d) for all m and d. ; 

Although this conjecture remains open, we can obtain certain results 
about the asymptotic behavior of J(2, 2” — 1, u2”) from the known 
classes of special cases in which [(2, 2” — 1, d) = [(2, 2” — 1, d). 
We would like to define 


? wae . 
Gu) Tis log. fF 1, u2") | 


Unfortunately, however, we have no assurance that the limit exists. 
In order to discuss the asymptotic behavior of the best BCH codes, 
we define ; 


su) = lim sup logs £2, 2" — 1, u2") a — 1, u2") 


Clearly 8(u) = s(w). Like s(u), §(u) must be a monotonic nonincreasing 
function of u, because if d’ > d, the codewords of the g-ary BCH code 
of distance d’ are a subset of the codewords of the g-ary BCH code of 
distance d. 
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We can prove that §(u) = s(u) for certain values of u, as indicated 
by the following theorem: 

If u = 2, then &(u) < s(27") 
Proof: We know that if u = 2™*, then 


12:20" 42") 2 10.2" = 1,2" =) 


= [(2,27 —1,2"* —1) mek 
Hence, 
log [(2, 2" — 1, u2”) fo lop 1, 21 2 el) 

m = m 

So 
m m—k xs 
4(u) < lim log [(2, 2 1,.2 1) = 32°) 

™m-+00 . m 

because s(wz) is continuous. Q.E.D. 


. This shows that §(u) = s(w) if wu = $, 4, $, --+ . Similarly, one can 
show from the recent theorem of Peterson and Lin that §(u) = s(u) 
for certain other values.of wu. 

We conjecture that §(u) = s(u) for all uv. This is a weakened form 
of Peterson’s conjecture that 7(2, 2” — 1, d) = I(2, 2" — 1, d) for 
all m and d. 
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I.quations Governing the Electrical 
Behavior of an Arbitrary Piezoelectric 
Resonator Having N Electrodes* 


By P. LLOYD 
(Manuscript received May 24, 1967) 


In a paper by J. A. Lewis (B.S.T.J., 40, 1961, pp. 1259-1280) general 
formulas for the electrical admittance of a piezoelectric resonator, having 
essentially one pair of electrodes, were derived in terms of motional param- 
elers associated with the normal modes of vibration of the device. The logical 
extension of this work to a resonator with N electrodes 1s presented here. 
Expressions are given for both the admittance and impedance matrices 
of the resonator. These matrices are expressed in terms of motional param- 
eters associated with, respectively, (7) the normal modes of vibration with 
all electrodes connected together, and (it) all electrodes left open circuited. 
The electrical equivalent circuit for the 2-port characteristics of the N 
electrode resonator ts given for two particular examples. 


I. INTRODUCTION 


General formulas for the electrical admittance of a piezoelectric 
resonator having essentially one pair of electrodes were derived by 
Lewis.1 These formulas are consistent with those derived earlier for 
special cases such as long bars and large plates (see, for example, 
Mason?). In Lewis’ work the admittance function is expanded about 
its poles in an infinite series. The residue at one of these poles deter- 
mines the strength of the contribution of the normal mode, associated 
with the pole, to the overall vibrational behavior of the resonator 
when it is driven at a frequency close to the natural frequency of the 
mode. Surprisingly, the work of Lewis seems to have seen little ap- 
plication, as far as can be judged, except for that of Lloyd and Red- 
wood,’ and Byrne, et al.* 


* Most of the work described here is based on part of the author’s doctoral 
thesis (University of London, 1966). 
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With the current interest in multi-electroded resonators, such as the 
monolithic crystal filter,® it is pertinent to consider the logical exten- 
sion of the work of Lewis to the case of an arbitrary resonator having 
N electrodes. A discussion of this problem has previously been pre- 
sented by the author,® and also by EerNisse and Holland.’ 

Included in Section II of this paper are the basic equations govern- 
ing the piezoelectric resonator, presented here for completeness. 

In Section III various integral relations are derived for use in Sec- 
tion IV where the properties of the admittance and impedance matrices 
are investigated. The electrical equivalent circuits for two particular 
2-port configurations of the N electrode resonator are also derived in 
Section IV, in order to illustrate the application of the admittance 
and impedance matrices. 

A brief list of the principal symbols used in the text is given below. 


1.1 List of Symbols 


p Mass per unit volume. 
-p, Mass per unit area of an electrode. 
u; Particle displacement vector. 
Si, Strain tensor. 
T,, Stress tensor. 
rt; Traction (stress vector). 
@ Electric scalar potential. 
i; Electric field vector. 
D; Electric displacement vector. | 
ci,,, Elastic stiffness tensor (measured at constant electric field). 
Piezoelectric constant tensor. 
Dielectric constant tensor (measured at constant strain). 
Unit vector normal to, and outwards from surface of body. 
Electric potential on the pth electrode. 
Total charge on the pth electrode. 
Volume of the body. 
Unelectroded area of the body. 
Area of the pth electrode. 
Angular ARMED OY 
= w, 


rn 
eS Bas 
a 8 


8 


8 


ERBRwWSO 


The tensor components above are referred to orthogonal Cartesian 
coordinate axes x; . The comma notation is used to indicate differentia- 
tion, e.g. D;,; = dD;/dx; , and the repeated index summation conven- 
tion is used, e.g., D;,, = Din + Das + Ds 3. 
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II. BASIC EQUATIONS OF A PIEZOELECTRIC RESONATOR 


The equations describing the steady vibrations of a piezoelectric 
body are listed below. 
The equations of motion: 


pru; + Ti;,; = 0. (1) 
The divergence equation of electrostatics (for an insulator): 
D;,; = 0. (2) 
The piezoelectric constitutive relations: 
T= Ciba — talc 5 (3) 
Da = @niSn + Sa, | (4) 
where 
Su = Fu + ua), ©) 
and 
EB, = —¢a, - (6) 
The symmetry relations 
Cijar = Cryin = Chier = Chris (7) 
Cnsg = Cnii » (8) 
Enn = Em « (9) 


2.1 Boundary Conditions 


The boundary conditions for the resonator shown in Fig. 1 will now 
be discussed. . 
On the unelectroded portion of the surface A 


i= 0, on A, (10) 
Dun; = e(E; extn; = 0, on A, (11) 


that is, no surface tractions and zero external electric field exist normal 
to the surface. Note that (11) is an approximation which in practice is 
usually valid for materials with large values of ¢%,/e,. The driving 
electrodes are assumed to be very thin metallic conductors with infinite 
conductivity. Potential 6, and charge Q, exist on the electrode area 
A, . External electrical connections to the electrodes will not be specified 
at present. We pause to note, however, that ¢ = 0 at some point ex- 
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Fig. 1— Arbitrary piezoelectric resonator with N electrodes. 


ternal to the resonator. Since we have neglected the effects of the 
external potential distribution, this ‘“‘earth’”’ point only has significance 
in connection with the topography of the external electrical circuit. 
The latter is assumed to interact only with the currents J, and potentials 
¢@, on the electrodes of the resonator. The mechanical properties of 
the electrode are assumed here to be nonexistent except for a surface 
mass density p,. The surface of the resonator beneath an electrode is, 
therefore, assumed responsible only for exerting a force consistent 
with maintaining the acceleration of the electrode. The boundary 
conditions at the electrode can therefore be written as 


are on A, , | (12) 
¢@ = constant, on A, (13a) 
and either 
¢= 9, or (13b) 
Dan; dA, = —Q, . (18¢) 
Ap 


Note that the choice between (13b) or (13c) as a primary condition 
is unimportant. 


III. PROPERTIES OF SOLUTIONS 


As is well known, the solution of the equations reviewed in Section 
II for a practical case is a formidable problem, and it is often neces- 
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sary to resort to some approximate method of solution. In this paper, 
we will not discuss the methods for solving the equations, but rather 
the nature of the solution assuming that it has been found. 

Equations (1) through (13) can be expressed in terms of u,; and ¢, 
from which it follows that 


prus + Cisnitle,sr + CniiPins = 0, (14) 
CnkiUkin — EnmPinm = O, (15) 
subject to the boundary conditions 
Crit, 12; + Cri3On; = 0, ond, (16) 
Cjx1Ue.1N; — €:.6.4n; = 0, on A, (17) 
with 
¢ = constant on A, (18a) 
and either 
¢ = &, on A, (18b) 
or 
; Din, dA, = —Q,, on A, (18¢) 
and 
Crete, My + CxiihyxNy = psi , on A,. (19) 


We now note that (14) through (19) become homogeneous when ®, = 0 
for all p. This latter condition represents one of the eigenvalue problems 
associated with Fig. 1, namely, that of mechanical vibrations possible 
when all electrodes are connected directly to the reference point. Other 
eigenvalue problems associated with Fig. 1 include those where some 
electrodes are open-circuited (Q, = 0) and the remainder are short- 
circuited (, = 0) (i.e., connected to the reference point). 


3.1 Reciprocal Theorem 


Consider two solutions of (1) through (18) denoted, respectively, 
by (\’, ui, ¢’) and (X”’, uw’, d’””). The two solutions could be, for example, 
those associated with two different sets of forcing parameters at dif- 
ferent frequencies. 

From (1) we have 


/ pata aR & / wT"! , dB = 0, (20) 
B B 
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and from (2) 
[eDe.aB=0, | (21) 
B . 


where B is volume of the body exclusive of the electrodes. Using the 
divergence theorem, (20) may be written 


/ oN ata aR = / SLT! dB + i; ulT!'n; dA =0, (22) 
B B A 
and (21) 
2 i) $';D!' dB + | $'Di'n, dA = 0. (23) 
’B A 


Subtracting (23) from (22), and substituting for the surface condi- 
tions given by equations (10) through (13) we have 


N N 
| | pr\""ulut’ dB + >> / pr”ulul’ dA ~— >> &Q? 
B p=1 YAp p=1l 
= | (4st, — E:DY) dB. (28) 
B 


Equation (24) is still valid when the primed and double-primed quan- 
tities are interchanged. Using this fact, we have 


—— ON N 
Oi rf | purur’ dB + >) [ puis’ aA | — Do (GQ; — 85/Q)) 
; B p=1 Ap p=1 
= | (crust, — BLDY) — (11,84 — BYD))aB. (25) 
B 
The quantity on the right-hand side is zero by virtue of the con- 


stitutive equations (3) and (4). 
Equation (25) then becomes 


N 
QO” = NV (wii!) = DE (@Qs" — 5/05), (26) 
where | 
N 
Vulut!) = / wie S / pulul’dA. (277) 
B p=1 “Ap 


Equation (26) is a special case of the reciprocal theorem given by 
Lewis’ and discussed by Love’ for the purely elastic case. 
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3.2 Orthogonality of the Eigensolutions 


Consider two eigensolutions, (\™, ui”,.@™) and (”, ul”, 6°”) 
of the same homogeneous boundary problem. That is, 6°” = 0 if 
oe” = 0andQg™ = 0if Q™ = 0. Thus, for two solutions of the same 


elgenset 


N 
> (amar — aa) =o, (8) 
p=l1 
and from (26) 
Veuyrus”) = 0, AM AA™, (29) 


Thus, two solutions of the same eigenset satisfy the orthogonality 
condition (29). Also we have from (24), the Rayleigh quotient for the 
eigenvalue \” 


2 f Hus,e) aB 
= B 


™ = we? = — Var umy ? (30) 
where 
Hu; , >) = (T5853 — H;D,). (31) 


3.3 Hxpansion in Terms of Eigensolutions 


The solution to the inhomogeneous boundary value problem in- 
dicated by Fig. 1 can be expanded in terms of any of the sets of eigen- 
solutions. These expansions are very important when electrical behavior 
is of prime interest. We will show here how the forced vibrational 
solution (A, u;, ¢) may be expressed in terms of two of the possible 
expansions, namely: (2) the eigensolutions (A*”, u3™, ¢°™) which 
correspond to the normal modes of vibration of the resonator with all 
its electrodes connected to the reference point, and (iz) the eigen- 
solutions (A°™, u?, 6°“) for the normal modes with all the electrodes 
open circuited. 

For expansion (2) we set 


u; = ul? + > aa (32) 
and 
d= ¢ ae os arg (33) 
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and for expansion (72) 


U; = ue abe = Hage (34) 
n=1 
and 
d = a? a > ba (35) 


where (u‘”, ¢) is the solution to the boundary value problem of 
(1) through (13), as \ — 0. 

Since we have not specified the means by which the electrodes are 
connected to the external electric circuit we will allow the parameters 
, and Q, to be of the general form 


®, = &;° exp (jul), Q, = Q;° exp (jut), (36) 
@,” = | 5° | exp (76,), A = | Q)” | exp Gy,). (37) 


Although it is immaterial how the charges Q, and potentials @, are 
set up in relation to the external circuit, Q, and ®, are of course not 
independent. 

The coefficients a in the first expansion can be found by noting 
that the equations of motion (1) and boundary conditions (12) require 


prus? = p DAS — raul, — in B (38) 
n=l 
and 
parur? = p, > AS™ — NYaMuZ™, on A,. (39) 
n=1 


On multiplying (88) and (89) by u3“ and carrying out the indicated 
integrations and adding we have: 


N 
| praurrus’” dB + Del prAurrus'” dA, 
B p=1 Ap 
e) N 
_ Sarre aff mtrar an + $f vata as] 
B p=1 4Ap 


(40) 
We note from (27) that (40) may be written 


AV ulus) a oS ar onr@ = d) Va ae): (41) 
n=1 
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From the orthogonality condition (29), all terms on the right are zero 
except the term in a”, giving 
a™ = » Viuzrrui ™) : 
~ Som) _ 4 Vus us”) 
By a similar argument we have for the coefficients in the second ex- 
pansion 


(42) 


b'™ = a ee Vurug ey . 
_ oe = d) Viul ul”) 

Remembering the definition of u\® , uS‘” 
ing as a consequence of the reciprocal theorem (26): 


(43) 


and u?‘” we have the follow- 


t 


N 
er Va a _ Ee nase 44 
( ) yD Q (44) 
and 
N 
ROP Vege) = ap Oa. (45) 
p= 
since 


es™ = 9 and Q?™ =0. 
Using (44) and (45) with (42) and (48) 


N 
XD Bs Qn™ 

— —- pie hans DS i. ne ce 

a Oro = yee” V(us us”) 


=» Darage 
From (46), we see immediately that the contribution of the S(m)th 
mode (eigensolution) in the expansion (32) and (33) is dominant when 
41— *™) if 6 is held constant with frequency. We also note that 
the amplitude of a‘”’ depends on the charge on the electrodes when 
the resonator is executing free vibrations corresponding to the S(m)th 
mode (i.e., with all electrodes short-circuited). 

Equation (47) shows similarly that the contribution of the O(m)th 
mode is dominant in the expansion of (84) and (35) when \ — \°™ 
if Q‘° is held constant. Also the amplitude b‘”’ depends on the po- 
tentials on the electrodes when they are open circuit with the resonator 


(m) 


(46) 
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executing free vibrations corresponding to the O(m)th mode. When 
applying the expansions (7) and (72) it should be realized that assump- 
tions have been made concerning the completeness of the eigensets. 


IV. THE ELECTRICAL ADMITTANCE AND IMPEDANCE MATRICES 


4.1 General Considerations 


The admittance matrix y,, for the N-electrode piezoelectric resonator 
is defined by 


N 


Lo Oe pes (48) 


q=1 


Similarly z,, , the impedance matrix, is here defined by 


N 
= Dale + B, (49) 
q=1 
where 
I, = jwQ, (50) 


and RF is a constant depending on the external circuit configuration. 
The relationships (48) and (49) are postulated on the basis that the 
equations of the resonator are linear and that their use is restricted Ma 
steady vibrations. 

We will now derive various properties of y,, and z,,. First we ae 
from (13c) and the divergence theorem that 


N N N 
Dis = Jo 2s Q, = jo a Dan; dA, = —ijo | D;,,dB = 0. 
p=l p=1 p=l1 VAp B 

(51) 


Equation (51) is simply Kirchhoff’s current law, for the conservation 
of charge. We note from (48) and (51) that 


Ds Yade = 0; (52) 


p=l1 q=1 


and since ®, is arbitrary, the sum of each column of the y,, matrix is 
Zero, 1.€., 


N 
Dy You = 0. (58) 


We will now use the reciprocal theorem to show that both y,, and z,, 
are symmetric. Consider the solutions for two sets of potentials 
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and #/’ having the same frequency. Then from (26) 


N 
Dd (810%' — £1/Q) = 0. (54) 
p=1 
Using (48) and (54) 
N oN 
Dy Ly WosFs Pe’ — Yoo®s’Bs) = 0, (55) 


and (49) and (54) 
jo 3 > (2p¢Q1Q5" ~ ZQ’Q5) + R Ds (Q,-— Qi’) =0. (56) 


p=1 q=l 


Since & and #/’ are arbitrary in (55) we must have 


Yoa = Yor - (57) 


N 
In (56) Q/ and Q! are arbitrary and >) Q, = 0, so 
p=l 


@pqa = Rap + (58) 
As a consequence of (57) and (53) 

N 

Die =O: | (59) 


q=1 


It has been shown in this section that the impedance and admittance 
matrices of an N-electrode piezoelectric resonator have properties 
similar in many ways to those of an N-terminal passive electrical 
network.’ 


4.2 Expansions for the Electrical Parameters 


We may now make use of the eigensolution expansions of (32)—(83) 
and (34)—(35) to inquire into the admissible forms for y,, and 2,, as 
functions of frequency. 

For expansion (a) 


Q, = Qi” ses 25 Goo (60) 


= & + Varar™ = 6 all 68 being zero. (61) 
m=1 
Then using (46) and (60) 


op SS WSL? 
Op = + De 2 ase — yy (62) 
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where 


S(m) FHS (m) 
absues Q, Q7 (63) 


pa vg Ss S : 
N ™ Vy ally) poe) 


For expansion (b) 


Q, = A + vO” = Qs, all QP being zero, (64) 


m=1 


b= B+ Doma. (65) 
m=1 
So using (47) and (65) 
N Ca) AFM (0) 
&, = bh” — —— Pte 66 
P > Dee ar OS — d) ( ) 
where 


O(m) xO (m) 
Pb, Pa 


PSS = 
pa o O 
et) Viug as 2) 


(67) 


We now define the charge-potential relations for the solution of the 
static boundary value problem (A = 0) as follows: 


N 
ee DOC. o.* (68) 
q=l1 
and 
N 
b= DEQ +R. (69) 
q=1 


It is assumed, from now on, that the static parameters such as C7, 
are such that quadratic forms like C/,®,8, are positive definite. Proof 
of this depends on energetic considerations. 


We now put 
°™ = wy, and AX” = wen , (70) 
and define 
Coe = Che 2 Cn (71) 
m=1 


We obtain from (62) and (71) the admittance matrix of (48) in the 


ELECTRICAL BEHAVIOR OF A PIEZOELECTRIC RESONATOR 1893 








form 
; 0 2 go 
a Ce? oes pa 5. 9 
Ya iad oa pe (ie oe) (72) 
From (66) and (69) the impedance matrix of (49) is of the form 
— Ll pw _ % os \ 
@pq = jw Fs p> (com _ w’) (73) 


Restricting our interest for the moment to an element y,, of the 
admittance matrix, we observe that the form of (72) is analogous in 
form to the admittance of the electrical network in Fig. 2. However, 
from (59), which is valid for all frequencies, we require 


N 

oS Ce S20, (74) 
but from (63) 

Ce 0; (75) 

so 

N 

De Ci eG: (76) 

aq=1,a4p 


Therefore, several elements of C{”” may be negative. 

The form of an element of the impedance matrix (72) suggests an 
analogy with the circuit of Fig. 3 but, in view of the proceeding dis- 
cussion, it is again probable that several of the elements F{” are negative. 
It should be noted, however, that F5” > 0. 


4.3 Driving Point Functions 


From (49) the driving point impedance Z,? , at the two terminals 


pa 


p-q, when all other terminals are left open-circuit is given by 


(ar Gc ba om 


w; We Ym 
‘ é sa ie} — = 
FOR YnasCo=Cpq? m= cm AND Wm = Wpm 


Fig. 2— Electrical network for admittance representation. 


1894 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1967 


Cy C2 Cm 
Co 
ee ae 
i—> @ co Om | 
—— a eee 


FOR Zpqr Co = 1/F OS), Cy = 1/ FAD) AND Gm = Wa 


Fig. 3— Electrical network for impedance representation. 


I 


Zea (®, — ,)/I, (77) 


= 2pp 224 or Zaq + 


In terms of the expansions for z,, given by (73), we have from (77) 


where 
fe ES Oy oS 
NO (12 y2™) 
and 
PO = FD — OPO + FY. (80) 


So, clearly F‘’” > 0 and therefore, the analogue circuit of Fig. 3 is 
“physical” for 7? . 

At first sight, it would appear that one could easily derive a driving 
point admittance for the p-q port when all other terminals are shorted. 
In fact, it must be found by appropriate manipulation of either y,, 
Or 2, and, in general, the expression includes many elements of either 
matrix. We can, however, define a driving point admittance for the. 
p-terminal, when all other terminals are connected to the reference 
point, i.e., 


Yy = Yoo + (81) 
The analogue electrical circuit for y,, , namely Fig. 2, is again physical. 


4.4 “Black Box’ Matrices for a Two-Port 


Before calculating any ‘‘black box” transfer matrices it is convenient 
to define a transformed admittance matrix valid for the resonator and 
its external circuit. In Fig. 4, the terminals of the resonator are all 
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interconnected, there being a physical component with admittance 
y;, connected between terminals p and g. The currents flowing into 
the N-terminal network and resonator as a whole are 


N 
LS es (82) 
q=1 
where 
Ube = Ver Yeas (83) 
and 
N 
E E 
Yoo = 2D Yoa + (84) 
a=l,ax¥p 


Yoq 18 defined by (48) and y,%7 = y2% , as can be seen from Fig. 4. We 
may now form two-port networks. For the purposes of further dis- 
cussion, any connections made externally to the two-port will be as- 
sumed to be consistent with 


f==-IL, f= -l,, V,= 4 — &,, Ve= & — ©. (85) 


4.5 Electrically Symmetric Two-Port Resonator 


Further discussion, with all y,” finite for the N-terminal resonator, 
will not be continued. The reduction of a N-terminal network to a 
2-port is discussed by Weinberg.’ 


4.5.1 T’wo-Port With N-2 Terminals Shorted 


We will now consider a simple case where all terminals except p 
and q are connected directly to s, and an admittance y," is connected 





Fig. 4-— External electrical connections to the resonator. 
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between p and q as shown in Fig. 5. We will also assume that the con- 
struction of the resonator is such that it is electrically symmetric 
with respect to the ports. We have under these circumstances 


LS eo Uae (86) 

Le = eel oe UV es (87) 
where 

Yoo = Vow + You (88) 
and 

Ye = You ~ You (89) 


We now consider the electrical lattice network of Fig. 6 as an analogue 
of transfer characteristics of (86) and (87). The analogue (I'ig. 6) is 
physical if Y, and Y, are realizable with physical components. 

Now 


Y, 


I 


Yor — Yona» (90) 
and 


Y, = Yon as Yoa + (91) 


Using (72) and (63), Y, and Y, can be expressed in terms of the eigen- 
solution expansion as follows: 





ice = ce 2 IM ae™ = gemaemy 

Y,= ial (CS? = Cae ps 8 — 0) POs mus) » (92) 
ce a. coer, 4, GEM OEM + GEM OEM) 

Y, = ial (CSD — Cy) a p> (yS™ =) Vue mys”) ° (93) 








Fig. 5 — Two-port system for N-2 electrodes short-circuited. 
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Fig. 6 — Electrical analogue of Fig. 5. 


Also since we have taken y,, = y,, , then from (63), either 


QF = QF" or QE” = Qe. (94) 


It therefore follows that the S(m)th eigensolution may only contribute 
to one of Y, and Y,, depending on sign of Q3‘”/Q?‘”. We also see 
that the electric circuit of Fig. 2 is a physical analogue for both Y, 
and Y,. 


4.5.2 Two-Port With (N-4) Terminals Open Circuit 


The symmetrical resonator with (N-4) terminals open circuit is 
shown in Fig. 7. We now use the z,, matrix of (49), (67), and (73) to 
derive the impedance matrix of this two-port, again subject to the 
restrictions of (85). We find that 


Vy = Zep] + Zoal (95) 

Ve = Lgl, + Zod, ; (96) 
where 

Lop = Spy PSs. ey (97) 
and 

Zoe = ®pa t ar — Spr — 2aq + (98) 


From (82) and (67) 


C-) (m) 
Zoq = = | ase -> a | (99) 


(a) nel (\? (m) 


SS 
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Fig. 7— Two-port system for N-4 electrodes open-circuited. 


where 
Gp = Fy + Fi — OF, (100) 
O(m) __ gO (m)\2 
Gp = Be 0 - 0 J ’ (101) 
Ven Vu; ae ) 
O(m) _— Om) (q~O(m) _— FO (m) 
Gi = (&, 2 )(&, ®, ) i (102) 
r m) Val ul) 
Goo = FEO 4 PLO — PO — FY, (103) 
Also since the resonator has been taken to be symmetrical, i.e., 
Log: Leis (104) 
it follows, from 67, that 
(G5 OF) Soe oO.) (105) 


If we represent the transfer equations (95) and (96) in terms of the 
lattice analogue of Fig. 8, subject to the restrictions of (85), we have 


Z = 1) ae = Go? 
a je PP Pa 





SIR a S| ee ie 09] 
Qe a d) Vana) 


| | 
j= Ms 
~ 
° 
g 








(106) 
2 = 7 | (G2 + Gly 
00 N (eo = oe.) (ee _ pom ni pom) = p(™) 
= pa x0 oe oem — ) VEE ue?) | 


(107) 
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Za 


Za 
Fig. 8— Electrical analogue of Fig. 7. 


We note by virtue of (105) that the O(m)th eigensolution only 
contributes to one of Z, and Z, depending on the sign of (®)°” — 6?) 
/(6?’ — 6°). Tt also follows that each of Z, and Z, have the circuit 
of Fig. 3 as a physical analogue. 


V. DISCUSSION AND CONCLUSIONS 


It has been shown how the electrical behavior of a piezoelectric 
resonator with N electrodes, represented by an admittance or im- 
pedance matrix, can be determined from the eigensolutions for free 
vibrations of the resonator. The results for the admittance obtained 
by Lewis* are contained here as the special case N = 2 in (72). The 
impedance of the two-electrode resonator is described by (73). This 
result was not given by Lewis’ since he did not consider the alternative 
expansion of the open circuit eigensolutions. It could be argued that 
since impedance is simply the reciprocal of admittance, residues of 
one could be found from the other. This would, however, involve 
rather cumbersome calculations. Furthermore, if an approximate 
theory is used to generate the eigensolutions, the expansions may only 
be valid in a small frequency range, thus making the calculation of, 
say, the residues of the impedance from the admittance expansion 
impossible. 

Returning to the general case of the N-electrode resonator, the elec- 
trical behavior can be predicted in terms of the admittance or im- 
pedance matrices of (72) and (82). If external electrical components 
are to be connected between the N electrodes, and a two-port com- 
posite network is to be formed, its transfer characteristics can be 
deduced from either matrix. In the particular case of the symmetric 
resonator with N-2 of its electrodes connected together, the admittance 
matrix provides simple results, whereas the impedance matrix is use- 
ful for the case of N-4 open-circuited electrodes. 

Before concluding, it might well be asked what reasons there are 
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for preferring a motional parameter representation of the electrical 
characteristics over the direct method of determining the impedance 
or admittance matrices from the solution to the inhomogeneous bound- 
ary value problem. These are essentially twofold. Firstly, if a two- 
port N-electrode reasonator is to be designed to realize some transfer 
function or driving impedance, or so on, an appropriate synthesis 
procedure will usually automatically yield these motional parameters, 
leaving only the task of physically realizing a resonator with these 
parameters! Secondly, if the inhomogeneous boundary value prob- 
lem is being solved, the inevitable numerical calculations are least 
likely to be accurate in just those ranges which are of prime interest, 
namely, the poles and zeros of the admittance or impedance matrices. 
Furthermore, considerable computing time would be lost, compared 
with the motional parameter method, if some transfer characteristic 
of the derived two-port were to be obtained for a large number of 
frequencies in a narrow band. 
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Properties and Device Applications of 
Magnetic Domains in Orthoferrites 


By A. H. BOBECK 
(Manuscript received August 15, 1967) 


It has been shown that tsolated magnetic domains in thin platelets (2 
mils thick) of orthoferrites can be manipulated to perform memory, logic, 
and transmission functions. The purpose of this paper ts to discuss 
the properties of orthoferrites that make them suitable for magnetic device 
applications and consider magnetostatic problems relevant to domain 
structures found to be useful. Included is a brief indication of how memory, 
logic, and transmission can be accomplished; however, the details will 
be reserved for a later paper. 

The stability conditions of a cylindrical domain are discussed in detail 
and data is reported to support the conclusions. Of particular interest are 
the sizes of cylindrical domains available in the various orthoferrites. Such 
data has been taken on five of the fourteen possible orthoferrites and tt is 
found that the thuliwm orthoferrite, TmFeO;, gives the smallest stable 
domain diameter (2.3 mils) and LuFeOs; the largest. The stability results 
lead to a direct method for obtaining ow , the domain wall energy density. 
For TmFeO; , as an example, ow = 2.8 ergs/cm’. 

It is concluded that the orthoferrites are well suited indeed for device 
applications. Experimentally, 3 mil diameter domains have been manip- 
ulated and there is every reason to believe that operation of sub-mil domains 
will soon be realized. 


I. INTRODUCTION 


Recently, P. C. Michaelis described a technique for propagating 
isolated magnetic domains in thin anisotropic ferromagnetic films.* 
He obtained controlled motion along either the easy (e) or hard (h) 
anisotropy axis although he used distinctly different mechanisms to 
obtain propagation in these directions. 

Michaelis’ ferromagnetic films were processed to have a uniaxial 
anistropy (i.e., hard and easy axis) in the plane of the film. Magneti- 
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ANISOTROPIC 
NiFe FILM 


Fig. 1— An isolated magnetic domain can be moved along the easy (e) or 
the hard (h) anisotropy axis. 


zation lies in the plane of the film and a magnetic domain, as illus- 
trated in Fig. 1, is seen to be an isolated reverse magnetization area 
bounded by a domain wall. The disparity in the propagation modes 
for the e and h directions is due to anisotropy inherent in the film 
itself. Propagation of domains along the diagonals is possible using 
conventional wall propagation and, in fact, Spain? has recently dis- 
cussed such a technique. 

Complete generality in the propagation (and interactions) of mag- 
netic domains, however, demands that the magnetization be aligned 
normal to the surface of the film. Furthermore, it would be useful if 
the magnetic properties were isotropic (or essentially so) in the plane 
of the film. A cylindrical domain in such a material is drawn in Fig. 2. 
These conditions are met in orthoferrites as first pointed out by R. C. 
Sherwood. Other similar materials are magnetoplumbite, barium fer- 
rite, and manganese bismuth. 

This memorandum will describe some of the results of generat- 
ing and propagating cylindrical domains. Related magnetostatic 
problems are introduced and discussed. It will be shown that the 
stability conditions for cylindrical domains lead to a method of 
determining wall energies and results obtained so far on orthoferrites 
are tabulated. Finally, a brief description is given of some of the 





Fig. 2—An ideal material permits magnetization normal to platelet surface 
and is otherwise isotropic. 
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device properties; however, this aspect will be treated in much more 
detail in a later memorandum. 


II. ORTHOFERRITES 


An excellent treatment of orthoferrites can be found in Ref. 3. 
Orthoferrites of the general formula MFeO3, where M is any rare 
earth ion are antiferromagnetic with a weak ferromagnetism caused 
by a slight canting (0.5°) of the antiparallel spins. The molecular and 
magnetic unit cell is an orthorhombic cell of sides a, b, and c with the 
c side about twice the length of a or b, as illustrated in Fig. 3. The anti- 
parallel Fe 3+ spins align along the a-axis with the c-axis exhibiting 
the weak ferromagnetism (47M, ~ 100 gauss). The lone exception is 
SmFeO; which has its net moment along the a-axis at room tempera- 
ture. The Néel temperature for all orthoferrites is about 400°C. 

The orthoferrites have a remarkable set of magnetic properties. 
When magnetized to saturation, fields of several thousand oersteds are 
needed to effect a flux reversal. This field (nucleation field Hy) is an 
order of magnitude greater than the magnetic moment of a typical 
orthoferrite. Thus, platelets having the c-axis normal to the planar 
surface are magnetically stable without an applied field even when 
fully saturated. Furthermore, once domain walls are present they can 
be moved with fields (wall coercivity H,) less than one oersted. Thus, 
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Fig. 3— Locaton of Fe3* spins in typical orthoferrite orthorhombic cell. 
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the orthoferrites are ideally suited for device applications which utilize 
materials with a reentrant B-H hysteresis characteristic. 

Orthoferrites can be grown from a PbO flux and all of the speci- 
mens evaluated during the course of this work were so prepared by 
J. P. Remeika and L. J. VanUitert. Occasionally, a particular run will 
yield nearly perfect single crystal platelets of orthoferrite. Dimensions 
vary with 0.1 inch by 0.2 inch by 2 mils thick being typical. For- 
tunately, most of these platelets grow with the c-axis normal to the 
plane and are thus ideally suited for cylindrical domain observations. 

The tendency to grow platelets is not characteristic of all orthofer- 
rites. For this reason, it has been necessary to develop techniques 
for preparing platelets from larger crystals. 

Orthoferrites are optically transparent, especially to the red spec- 
trum. Magnetic domains in a thin platelet (2.3 mils thick) of TmFeO; 
are readily seen using the Faraday rotation of transmitted light. Note 
in Fig. 4 the random orderliness of the areas magnetized up (dark) 
and down (light). Magnetostatic and wall energies balance to de- 
termine the general shape as well as the dimensions of the domains. 
It is only with a great deal of reluctance that these domains yield 
to an inhomogeneous field. 


III. MAGNETOSTATICS AND STABILITIES OF STRIPS AND CYLINDERS 


Assume, as pictured in Fig. 5, that a loop of wire is placed in 
contact with the surface of an orthoferrite platelet. When a current 





X16 


Fig. 4— Faraday observation of magnetic domains in TmFeQOs. Note the 
isolated oval domain. 
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(a) BEFORE (b) arter 


Fig. 5 — In the process of generating a cylindrical domain a current applied to a 
loop alters the static domain pattern of (a) to that of (b). 


is applied to the loop the resulting field pattern will perturb the exist- 
ing domains. Areas of magnetization seeing a favoring applied field 
will grow — others will shrink. To produce a cylindrical domain, note 
that it will be necessary to “pinch off” at several points. To maintain 
a cylindrical domain when the applied current, I, is removed, it is 
usually necessary to apply a dc bias field normal to the surface of the 
platelet. The conditions under which the cylindrical domain is stable 
will be analyzed in this section. 


3.1 Magnetic Strip Domain 


To a first approximation, the field necessary to “pinch” a strip 
domain to produce a cylindrical domain can be equated to the field 
required to compress a strip domain to zero width. The strip resists 
compression since a high magnetostatic energy state is generated. The 
magnetostatic field effective when the strip approaches zero width 
can be obtained by inspection and is 47M, (see Fig. 6). One im- 
mediately sees the significance of the low moment of the orthoferrites 
since the applied fields necessary to generate cylindrical domains will 
be directly related to the magnetic moment. 





at 6— The internal magnetostatic field in a uniformly magnetized platelet 
is Ar ae 
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Fig. 7— Normal magnetostatic field component generated by a strip of mag- 
netic charge is related to the angle @. 


The relationship between the width W of a strip domain and the 
applied field H, will now be discussed. (Refer to Fig. 7.) Consider a 
strip of magnetic charge located in the xy plane and extending to 
infinity in the +y and —y directions. The magnetic field component 
perpendicular to the zy plane, H,, is directly related to the angle 6 
subtended by the strip and is given by H, = 2M,6. (A similar rela- 
tionship exists for a strip carrying a uniform current.) 

Consider a strip domain of width W in an orthoferrite platelet of 
thickness h. This case is illustrated in Fig. 8. By means of the angle 
relationship discussed above the z-component of field, produced by the 
magnetic surface charges, can be quickly obtained. For example, the 
field at the midpoint of either domain wall (sides of the strip) is 





Fig. 8— The applied field necessary to sustain a strip domain is derived in the 
text. ; , 
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H(z = h/2) = 8M, tan’ h/2W. (1) 
The polarity of this field is such as to cause the strip to widen. An 
applied field, equal in magnitude and opposite in sign, is, therefore, 
needed to maintain the strip at a width W. 
More significant is the average field effective on the walls under the 
assumption that the walls are rigid (do not bulge outward). In 
Appendix A the desired relationship is derived and is 


ey a ane 
iM | tan (2) a, (1 + lm @) 


A similar expression is found in Kooy and Enz.* Equation (2) is 
plotted in Fig. 9. Note that as the strip narrows (W — 0) the normal- 
ized field approaches unity as discussed previously. For a very wide 
strip the field effective on the wall tends to zero and the walls are 
stable in position without an external applied field. 

Equation (2) tells us to what extent the surface charge of the 
strip and the sheet cancel. As W increases, the walls see cancelling 
fields from the surrounding magnetic charge and the wall field re- 
duces to a low value. If the strip is driven closed, however, the 
quantity of nullifying charge on the strip itself goes to zero and the 
full internal field 47M, is felt. If the wall coercivity, H,, is very low 
only very wide strips (W>h) will be stable in the absence of an 
applied field. 





3.2 Cylindrical Magnetic Domains 
In the derivation of equations which related to the performance of 
a strip domain, it was not necessary to include a domain wall energy 


“MAGNETOSTATIC” 
_ WALL FIELD 
~~ FOR STRIP 





W/h 


Fig. 9— An applied field of 47M, is necessary to collapse a strip to zero width. 
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term since the walls maintained constant area. Such is not the case 
for a cylindrical domain. 

Consider as shown in Fig. 10 a cylindrical domain of radius r in a 
platelet of orthoferrite of thickness h. Assume that the domain wall 
which defines the cylindrical domain has straight sides, i.e., the shape 
is neither “hour-glass” nor “barrel” like. The total energy, relative to 
a uniformly downward magnetized platelet can be written as 


Ep(total) = éy(wall) +é (magnetostatic) + &, (applied) 
or, 
Ep = 2arhoyw — Ep + 2M .H arth, 
using CGS units where the domain wall energy density ow is in 


ergs/cm?. The partial derivative of the energy with respect to r gives 
the force on the wall. 


O&r 
or 


It is assumed that dow/or is negligible. In terms of fields, 


=e oe pea AT The 


0&7 /Or Ow 0£,/dr 


Gish Gi Gite te (3) 


(1) (IT) UII) (IV) 

Equation (3) is a stability relationship in terms of magnetic fields. 
It relates the net effective field on the wall (I) to the wall field (ZZ) 
trying to compress the cylindrical domain, the demagnetization field 
(III) trying to expand that domain, and the applied field (JV). If the 
net field (IZ) is positive the domain will compress, if it is negative it 
will expand. 

It is convenient to express the “wall energy” and “magnetostatic” 
contributions as fields. Term (JJ) can be equated to a wall field, 











Fig. 10 — A cylindrical domain in a platelet of thickness h. 
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termed Hyy, since it is the field contributed by the wall energy density, 
ow. So 


Hy = eit (4) 
Note that this field, which goes as 1/r, is the eventual cause of the 
collapse inward of the smallest domains. The importance of (4) which 
is plotted in Fig. 11 cannot be overstressed since it points out the 
significant role that wall energy will play in orthoferrite devices. 

A. Thiele® has obtained an expression in closed form for (III) the 
magnetostatic field which we shall designate as Hp. A derivation of 
the magnetostatic energy ép is not readily obtainable. An alternative 
derivation, somewhat simpler but less rigorous than the technique 
employed by Thiele, is presented in Appendix B. The result (by either 
method) is 








Hy _ 2) _ 2% Td a DEU: 
re ie at + V1 + Gr, r/?) | (5) 
where E(k, x/2) is the complete elliptic integral of the second kind 


and 


ke? = eet ees xt 

1+ (h’/419) 

The normalized magnetostatic field plots much (see Fig. 12) as the 
strip field of Section 3.1. The sense of Hp is always to attempt to 
expand the cylindrical domain. Equation (5) is plotted in detail in 
Fig. 24. 


“WALL ENERGY” 
_— WALL FIELD 
FOR CYLINDER 





er/h 


Fig. 11— The wall energy, ow, generates a field of a sense to collapse a cylin- 
drical domain. 
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“MAGNETOSTATIC” 
_— WALL FIELD 
FOR CYLINDER 





er/h 


Fig. 12—- The magnetostatic energy generates a field which attempts to expand 
a cylindrical domain. 


It is probably well to reiterate that the fields H4, Hw, and Hp 
are assumed to be acting on a rigid cylindrical domain wall and Hw 
and Hp have no significance unless applied to the domain wall itself. 


3.3 The Stability of a Circular Domain 


We are now in a position to discuss the stability of a cylindrical 
domain. Three cases will be considered. They are (2) r < h, a very 
thick platelet, (77) r >> h, a very thin platelet, and (272) r ~ h, a “just 
right” platelet. Case (7) will be magnetostatic energy dominated, case 
(iz) wall energy dominated, and case (cz) will have these energies 
somewhat in balance. 


3.3.1 Very Thick Platelet 

For a cylindrical domain of radius r in a platelet of thickness h, 
where r < h, the magnetostatic field Hp of (5) may be approximated 
as 47r//,. In this case the critical radius, 7,, is obtained by equating 
the magnetostatic field to the wall field. This is graphically done in 
Fig. 13. Analytically, 





apt See We ot 
40M. = 2r,M, 
and 
lo pa mere (6) 


For typical orthoferrites, r, 1s the order of 0.5 mil. Domains of a 
radius greater than 7, will expand uncontrollably while those of radius 
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UNSTABLE 
EQUILIBRIUM 
/ 


7 





re ee 


Fig. 13 — Metastable equilibrium exists with “very thick” platelets. 


less than rg will contract to oblivion. The significance of rg is that it 
gives an absolute lower bound on the stable domain size. Note that 
the problem of initially establishing a domain of radius rz has been 
carefully avoided. 


3.3.2 Very Thin Platelet 

At the other extreme are the conditions that exist in a very thin 
platelet, ie.,r >> h and the wall field completely dominating the mag- 
netostatic field. Note Fig. 14. In the absence of a coercivity the criti- 





Pc 
<~-——0 


c—> 


Fig. 14 — Stability condition for a “very thin” platelet. 
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cal radius r, will approach infinity. With a wall coercivity H, the 
minimum stable radius can be obtained from 


ow 
ar 2r.M, 
or (7) 
pea 
s 2M,H, 


For typical orthoferrites, r, is 40 mils. Indeed, difficulty is experienced 
in generating domain patterns of any kind in very thin (less than 
one mil) platelets. 


3.3.3 ‘Just Right” Platelets 

We have seen that cylindrical domains in very thick platelets are 
completely unstable. Also, that in very thin platelets only excessively 
large domains are stable and then only if some form of wall pinning 
is assumed. 

A stable cylindrical domain can be obtained if the thickness A of 
the platelet is chosen so that at the point of intersection of the Hp 
and Hy curves 


OH y : 


or 


| ote (8) 


or 











In general, a bias field (H,;.,) is needed to secure the intersection at 
“6” which satisfies (8). Refer to Fig. 15 and note that an Hy + Ay;,,curve 


UNSTABLE 
/ 


Hp 
je 
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Fig. 15— A stable cylindrical domain can be obtained if a suitable bias field 
is applied. 
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is plotted. For a radius somewhat greater than r, the combined wall 
and applied fields close the domain to r,. If the starting radius is less 
than r, but greater than r* the dominant magnetostatic field will 
open the domain to r, . Any domain of radius less than r* will collapse. 
By adjusting the bias field r, can be varied somewhat. For example, 
in TmIeO, , 2.3 mils thick, r, can be varied from 2.8 mils to 1.2 mil 
as the bias field is changed from 26 Oe to 36 Oe. 

As Hpias is increased, r*# and r, approach each other and become 
equal when the Hy + H,;,, curve is tangential to the Hp curve. With 
a further increase, all cylindrical domains become unstable and col- 
lapse. This leads to a direct method of obtaining the wall energy 
density, ow - 


IV. WALL ENERGY DENSITY, ow 


The cylindrical domain radius r, as a function of an applied field 
Hyias has been measured for a number of rare earth orthoferrites. A 
typical curve of domain radius vs applied field is plotted in Fig. 16. 
This experiment, as was pointed out above, is a direct method for 
obtaining ow the domain wall energy density. 

For the 2.3-mil thick platelet of TmFeO3 described previously the 
minimum r observed is 1.15 mil at an applied field of 36 Oe. Since 
h = 2.3 mils, 2r/h = 1. From Fig. 24, Hp/4c7M, = 0.58. For TmFeO; 
4rM, = 140 gauss so Hp = 81 Oe. Thus, Hw = Hp — Higgs = 81 — 
36 = 45 Oe. Therefore, 


Cr = 2rH MM, 


= 2.80 ergs/em’. 


40 
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Fig. 16 — Cylindrical domain size as a function of an applied bias field. 
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The average results obtained on orthoferrites for which platelets — 
were available are tabulated in Table I. Note that in addition to the 
measured values of rmin that a calculated 2rrq (Section 3.31) is also 
included. Thiele> has shown that 2zr, is the optimum platelet thick- 
ness. Platelets of thickness 277, will sustain cylindrical domains of the 
minimum possible diameter and this diameter will be approximately 
2xr,. Note that TmFeO; has the lowest calculated 2zr, (as well as 
the lowest observed domain size), and is thus a prime contender for 
device applications. It is hoped that some other orthoferrite such as 
YFeO3 which has a reported’ ow of 1.0 erg/cm? and 47M, of 105 gauss 
may give yet smaller domains than TmFeOs3. 


V. GENERAL COMMENTS AND OBSERVATIONS 


The sequence of pictures of Fig. 17 give a pictorial display of much 
of the material described in the preceding sections. An “as grown” 
TmFeOs3 platelet, 2.3 mils thick was subjected first to an increasing 
bias field applied parallel to the c-axis (perpendicular to the planar 
face) and then to a decreasing field. 

The platelet is demagnetized, i.e., equal areas of magnetization up 
and down, if no field is applied (a). The average width of a domain 
is the consequence of a magnetostatic and wall energy balance and 
as such can be used to obtain a crude estimate of the wall energy. It 
is an observation that for very thick or very thin samples the strip 
width is increased from that shown. 

As the field is increased, the dark strips narrow and there is a 
general reshuffling of domains. At (g) one of the “dumbell” domains 
has collapsed into a cylindrical domain. Further increase in field finds 
a total of five such domains (2). At 87 Oe applied field, three of the 
five have collapsed (j) and the remaining two would also collapse if a 
few more tenths of an oersted was applied. However, at this point 


TABLE I 
Observed Calculated 
4rM 6 ow 2rmin Field Thickness 27a 
Orthoferrite (gauss) (ergs /cm?) (mils) (Oe) (mils) (mils) 
HoFeQ; 91 2.0 4.6 12 2.1 3.8 
ErFeO; 81 1.7 6.0 8 2.0 4.1 
TmFeO; 140 2.8 2.4 36 2.3 2.2 
YbFeO; 143 3.0 3.8 59 4.4 2.3 
LuFeO; 119 3.9 8.8 4 1.4 4.3 
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the field was slowly reduced and the two cylindrical domains grew 
in size. At 27 Oe they became unstable as circles and blew out into 
strips (k). The circle to strip to circle process can be repeated with a 
field perturbation of 1.5 oersted. Thus, the wall coercivity, H,, is 
probably less than 1.5/2 = 0.75 oersted. If the field is reduced to zero 
a pattern superficially like (a) appears. 

By passing the tip of a fine magnetized wire over the surface of 
a demagnetized platelet it is possible to “cut” through the strip do- 
mains. Then as the bias field is applied large numbers of “bubbles” 
appear such as in Fig. 18. Our next problem is to look at the ways in 
which these “bubbles” can be manipulated to do logic and storage. 

It will suffice, for the purposes of this article, to just indicate very 
briefly some of the operations that are possible. These are illustrated 
in Fig. 19. Since in the correct environment cylindrical domains have 
been shown to be stable, storage is readily available. Transmission of 
a domain from location 1 to location 2 is achieved by energizing a 
conductive loop located at position 2. The domain, in seeking the 
lowest energy state, readily moves to position 2. To obtain a com- 
plete set of logic functions an interaction is required. The magneto- 
static repulsion which exists between domains ensures that only one 
domain will move into position 2 if that loop is energized. New 
domains can be created by replication. This involves literally tearing 
a single domain into two halves which then expand to full size. The 
use of the full set of operations allows the possibility of data process- 
ing applications. 

It is apparent that a multi-dimensional shift register can be built 
using transmission with a three-phase drive source to achieve the 
desired directionality. Information is inserted by selective replication 
at the input of the register. Most of the early device work has cen- 
tered on the design and operation of multi-dimensional shift registers. 
Initial success was obtained when a 2.2-mil thick platelet of HoFeO3 
was combined with a waffle-iron like high-permeability ferrite base- 
plate® The baseplate consisted of a matrix of 10-mil by 10-mil posts 
positioned on 15-mil centers. Single turn windings were wrapped about 
selected posts and series connected to form the five distinct propa- 
gation phases identified in Fig. 20(f). The waffle-iron posts served to 
facilitate the wiring procedure as well as to precisely define the ap- 
plied field patterns. Assume as shown in (a) that domains exist in 
the orthoferrite platelet at the upper left and middle left 61 locations. 
The location of the domains can be ascertained by viewing the results 
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(b) 16.0 Oe 





(c) 20.6 Oe | | (d) 21.8 Oe 





(e) 2270e (Ff) 2450e 


Fig. 17 — Faraday studies of a platelet TmFeOs; orthoferrite, 23 mils thick, 
c-axis normal to the surface. Sample originally demagnetized (a). Field applied 
normal to the surface, first increased, then decreased. Note at (g) that the strip 
in the upper left-hand corner became a cylindrical domain. This, and the other 
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(kK) 27.0 oe | (L) ZERO FIELD 


Fig. 17 — (continued) 


cylindrical domains which formed, reduced in size until (j) when three of five 


collapsed. As the field is decreased the remaining two bubbles open into strips 
(k) and eventually grow to fill the entire platelet (1). 
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b-AXIS 


a-AXxIS 


xX 16 





Fig. 18— Numerous cylindrical domains produced by “cutting” strip domains 
with a magnetized wire. 


of a magnetic colloid interaction with the domain walls defining the 
cylindrical domains. Application of a current pulse to 62 causes the 
pair of domains to step one post position to the right. In this manner 
the domain patterns of (b) through (d) are generated. The sequence 
123145132154 . . . causes domains to propagate clockwise in the upper 
and lower loops resulting in a residual pattern (e) being generated by 
the colloid. 

More recent work has been directed toward improving the storage 
density of the shift register. Operation with 3.5-mil diameter domains 
has been achieved and there is every indication that sub-mil domains 
can also be propagated. The final storage density of the device ap- 


TRANSMISSION 
INTERACTION 


REPLICATION 





Fig. 19 — Illustration of transmission, interaction, and replication. Shaded areas 
represent cylindrical domains, circles the drive loops, and underlined numbers 
the energized loops. 
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pears to be limited by wiring pattern resolution rather than any 
magnetic property. 


VI. CONCLUSIONS 


A variety of experiments have been performed on orthoferrites. 
These include cylindrical domain stabilities, strip stabilities, magneto- 
static interactions, and device applications. It has been found that the 
concept explained in the body of this memorandum of considering 
the magnetostatic and wall energies as generating equivalent fields, 
is useful toward a first order understanding of the phenomenon ob- 
served. . 

Cylindrical domain stability has been studied in detail. The wall 
energy density ow and the magnetic moment 47M, are seen to be sig- 
nificant factors limiting the minimum available diameter of a cylindri- 
cal domain. With experiments completed on five of fourteen orthofer- 
rites the results have shown that TmFeQOs; has the smallest stable 
domain diameter, 2.3 mils. There is every reason to expect that sub- 
mil domains will be realized in other orthoferrites. 

This paper has discussed idealized, elastic, domains. However, most 
of the early successes in manipulating domains in shift register, logic, 
and memory structvies were accomplished with rather thin, high 
coercivity platelets. Further study will be required to determine the 
optimum blend of operating characteristics. 
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APPENDIX A 


It is desired to derive the average z-component field H, acting on 
the domain walls which define a strip of magnetization reversal of 
width W located in an infinitely large magnetic platelet of thickness 
h. See Fig. 21. Consider the domain wall located in the X = 0 plane. 
Note that the surface magnetic charge of the strip itself and that of 
an image strip also of width W will produce cancelling fields at any 
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(@) BUBBLE STARTING (b) PULSE 2 
POSITIONS 





(c) PULSE ®3 (d) PULSE ©1, THEN 04 


Fig. 20—Sequence of photographs illustrating two-dimensional shifting of 
cylindrical magnetic domains in HoFeOs orthoferrite. Operation was obtained on 
a ferrite waffle-iron baseplate and observed with a 3M Colloid Viewer. 
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(€) RESIDUAL PATTERN (fF) IDENTIFICATION 


IN COLLOID VIEWER OF DRIVE LOOP 
AFTER CONTINUOUS DESIGNATIONS 


123145132154 SEQUENCE 


Fig. 20 — (continued) 


“2”? and thus need not be considered. The field at ‘‘z” due to the upper 
right-hand sheet of charge is 





Hy Law (f) 
‘nM, Qn COW 
Now 
- tf? 
A, == i Hae 


This equation assumes that the domain wall is rigid and therefore 
that the force acting on the wall can be averaged. 
So 





3 ee Cae a) 
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ED pes tl ¥ I. 
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Since four sheets of magnetic charge are acting on the wall (producing 
components identical in field direction and magnitude) the total 
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Fig. 21—JIdentification of parameters used in the derivation of the strip 
magnetostatic field. 


average field becomes 


H, 2 eth Ww h? 

tel, ~ 2 tan (i) — (2 +) 
In the body of the text (Section 3.1) the z-component of field for 
the strip is designated H, and the above expression is entered as (2). 

The assumption that the walls defining the strip domain are rigid 

(and straight) is a good approximation for W > h and poor for 
W <h. In the latter case, the average force acting on the walls 
differs significantly from the maximum forces experienced by the 
walls. It is expected that for W < h the walls will bulge outward. 


APPENDIX B 


The calculation of the average wall field H, produced by surface 
magnetic charge, for a circular magnetic domain proceeds in much 
the same manner as for the strip. As in the strip case the domain wall 
itself is assumed to be rigid. The mathematics are simplified if it is 
recognized that a cancellation cylinder exists as shown in Fig. 22. 
Only one quadrant will be considered with a factor of eight included 
to account for all four quadrants plus a top and bottom. 

In general, as defined in Fig. 23, 
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Fig. 22 — Figure useful in the derivation of the average wall field of a cylindrical 
domain produced by surface magnetic charge. 
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Fig. 23 = Figure showing field H, is related to M, and the angle a. 
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Fig. 24 — Cylinder and strip magnetostatic fields. 
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The integral is the complete elliptic integral of the second kind 
E(k, +/2) where 


Ki = ic 
cl 4r3 
In the text, the magnetostatic field effective on the cylindrical domain 
wall is designated Hp thus the final expression, which is (5), becomes 


Hy _2| 2m, f) 4 aro, | 
Je = 2) oN pee) 


Equation (5) and (2), representing the magnetostatic field of the 
cylinder and strip, respectively, are plotted in detail in Fig. 24. 
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Interpolation of Data With 
Continuous Speech Signals 


By M. R. SCHROEDER and S. L. HANAUER 
(Manuscript received March 1, 1967) 


In some communications systems, the need arises for temporally 
interpolating data or signalling information during continuous speech.’ 
If the required time gaps are created by simply interrupting the speech 
signal, severe degradation of speech quality and some loss in intel- 
ligibility results. 

The reason for the degradation is twofold: 


(¢) The interruptions introduce discontinutties in the speech signal— 
two for every interruption. 

(77) The interruptions, unless occurring pitch synchronously, create 
an inharmonic signal. 


In the following, a proposal is described which avoids discontinuities 
and is pitch synchronous—without the need for pitch detection. Average 
“off-time” ratios of 30 percent have been achieved for continuous speech 
without audible degradation. These results were obtained by computer 
simulation of a sampled data system. The instrumentation for a real- 
time analog system is simple. 

The gaps created by this method occur at irregular intervals in time. 
Thus, for a steady flow of data or signalling information, some buffer 
storage and coding that distinguishes “gaps” (interpolated data) from 
speech is required. 

In the proposed interpolation system, the speech signal s(¢) is divided 
by its envelope 


a(t) = [s*() + SH}, (1) 
where §(f) is the Hilbert transform’ of s(¢). The resulting signal is 
then multiplied by a modified envelope 


3 af)—c if a>e 
a(i) = (al) - }. = . (2) 
0 if ase, 
where the function { }, equals its argument. for positive arguments 
and is zero otherwise. 
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The combination of these two operations results in the desired in- 
terrupted signal 


1@= “0 (a(t) —c},. (3) 


The average off-time ratio depends on the magnitude of the constant 
c. For Gaussian signals, this ratio is given by 


Trott = 1 — exp [—c’/a’). (4) 


In one of the computer simulations, c was chosen equal to 0.5 a. 
For a Gaussian signal, this choice corresponds to 


c= ina". (5) 
Thus, the average off-time becomes 
Tore = 1 — exp [—7/16] = 0.18 = 18%. (6) 


The actually observed off-time ratios for c = 0.5 a for two test sentences 
were 26 percent for male speech and 16 percent for female speech. 
(a was obtained by averaging a(t) over 20 msec with a rectangular time 
window.) 

Tig. 1 shows microfilm outputs from a computer simulation. The 
constant c was chosen equal to 0.9 @ in order to achieve off-time ratios 
near 50 percent. The first line shows the original signal s(¢), the second 
line the interrupted signal s;(¢) and the third line the “switching func- 
tion” {1 — c/a(é)}, . The actual off-time ratio for the total utterance 
(one sentence, male speaker) was ro¢, = 55 percent. 

The speech quality for this rather large off-time ratio was Judged 
somewhat nasal but as intelligible as the original. It is possible that 


Fig. 1— (a) Original signal s(t). (b) Interrupted signal s:(£). (c) “Switching 
function” {1 — 0.9a(t)/a(t)},. 
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Fig. 2— Possible implementation of a system for interpolation of data with con- 
tinuous speech signals. 


even better results might be achieved by “smoothing” the switching 
function, by filling in short gaps and by eliminating short speech bursts. 

A possible implementation is shown in Fig. 2 in block diagram form. 
The division by a(t) indicated in (8) can be effected by single-sideband 
modulation followed by infinite clipping. The function {-}, corresponds 
to standard half-wave rectification. The multiplication by {a(t) — c}, 
can be effected by a switch-type modulator. The desired interrupted 
signal is obtained by a downward frequency shift of the modified- 
envelope single-sideband signal. 

The problem of interpolating signalling information with speech 
arose in a new mobile communication system for trains and was brought 
to our attention by Mr. C. E. Paul. 
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